{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# Setting things up\n", "\n", "## About this notebook\n", "\n", "In this notebook, we embark on a journey to refine the Iris Dataset for optimal performance in multiclass classification tasks, leveraging the capabilities of the ``MulticlassCarver`` pipeline. Recognized for its association-maximizing discretization, ``MulticlassCarver`` is a versatile Python tool that gracefully handles diverse data types—be they quantitative or qualitative. Our specific objective is to prepare the dataset for multiclass classification, illuminating the distinctive characteristics of Iris flower species.\n", "\n", "The Iris Dataset, a classic in the realm of machine learning, presents features such as sepal and petal dimensions for three different Iris species. By employing ``MulticlassCarver``, our goal is to discretize both quantitative and qualitative features seamlessly, tailoring them for effective representation in our multiclass classification models.\n", "\n", "Throughout this notebook, we'll unravel the intricacies of ``MulticlassCarver``'s discretization pipeline, showcasing its adaptability to various data types. Whether it involves transforming petal lengths or encoding species information, ``MulticlassCarver`` ensures that each feature is finely tuned for our multiclass classification tasks.\n", "\n", "Join us in this exploration as we harness the power of ``MulticlassCarver`` to preprocess the Iris Dataset. Through effective feature engineering and discretization, our aim is to create a dataset that not only distinguishes between Iris species but also sets the stage for the development of accurate and impactful multiclass classification models.\n", "\n", "Let's dive in and uncover the potential of ``MulticlassCarver`` in transforming the Iris Dataset for optimal predictive modeling.\n", "\n", "\n", "## Installation" ] }, { "cell_type": "code", "execution_count": 1, "metadata": {}, "outputs": [], "source": [ "# %pip install AutoCarver[jupyter]" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Iris Data\n", "\n", "In this example notebook, we will use the Iris dataset.\n", "\n", "The Iris dataset is a classic and widely used dataset in the field of machine learning and pattern recognition. It was introduced by the British biologist and statistician Sir Ronald A. Fisher in 1936 and has since become a benchmark dataset for various classification and clustering tasks.\n", "\n", "The dataset consists of measurements from 150 iris flowers, belonging to three different species: setosa, versicolor, and virginica. Four features are included for each flower: sepal length, sepal width, petal length, and petal width, all measured in centimeters.\n", "\n", "The primary objective of the Iris dataset is typically to classify iris flowers into one of the three species based on these four features (multiclass classification)." ] }, { "cell_type": "code", "execution_count": 1, "metadata": {}, "outputs": [ { "data": { "text/html": [ "
| \n", " | sepal length (cm) | \n", "sepal width (cm) | \n", "petal length (cm) | \n", "petal width (cm) | \n", "iris_type | \n", "
|---|---|---|---|---|---|
| 0 | \n", "5.1 | \n", "3.5 | \n", "1.4 | \n", "0.2 | \n", "setosa | \n", "
| 1 | \n", "4.9 | \n", "3.0 | \n", "1.4 | \n", "0.2 | \n", "setosa | \n", "
| 2 | \n", "4.7 | \n", "3.2 | \n", "1.3 | \n", "0.2 | \n", "setosa | \n", "
| 3 | \n", "4.6 | \n", "3.1 | \n", "1.5 | \n", "0.2 | \n", "setosa | \n", "
| 4 | \n", "5.0 | \n", "3.6 | \n", "1.4 | \n", "0.2 | \n", "setosa | \n", "
| \n", " | sepal length (cm) | \n", "sepal width (cm) | \n", "petal length (cm) | \n", "petal width (cm) | \n", "iris_type | \n", "
|---|---|---|---|---|---|
| 136 | \n", "6.3 | \n", "3.4 | \n", "5.6 | \n", "2.4 | \n", "virginica | \n", "
| 17 | \n", "5.1 | \n", "3.5 | \n", "1.4 | \n", "0.3 | \n", "setosa | \n", "
| 142 | \n", "5.8 | \n", "2.7 | \n", "5.1 | \n", "1.9 | \n", "virginica | \n", "
| 59 | \n", "5.2 | \n", "2.7 | \n", "3.9 | \n", "1.4 | \n", "versicolor | \n", "
| 6 | \n", "4.6 | \n", "3.4 | \n", "1.4 | \n", "0.3 | \n", "setosa | \n", "
| \n", " | target_mean | \n", "frequency | \n", "count | \n", "
|---|---|---|---|
| x <= 4.400e+00 | \n", "0.0000 | \n", "0.0200 | \n", "2 | \n", "
| 4.400e+00 < x <= 4.600e+00 | \n", "0.0000 | \n", "0.0300 | \n", "3 | \n", "
| 4.600e+00 < x <= 4.700e+00 | \n", "0.0000 | \n", "0.0100 | \n", "1 | \n", "
| 4.700e+00 < x <= 4.800e+00 | \n", "0.0000 | \n", "0.0500 | \n", "5 | \n", "
| 4.800e+00 < x <= 4.900e+00 | \n", "0.0000 | \n", "0.0300 | \n", "3 | \n", "
| 4.900e+00 < x <= 5.000e+00 | \n", "0.1429 | \n", "0.0700 | \n", "7 | \n", "
| 5.000e+00 < x <= 5.100e+00 | \n", "0.1667 | \n", "0.0600 | \n", "6 | \n", "
| 5.100e+00 < x <= 5.200e+00 | \n", "0.3333 | \n", "0.0300 | \n", "3 | \n", "
| 5.200e+00 < x <= 5.300e+00 | \n", "0.0000 | \n", "0.0100 | \n", "1 | \n", "
| 5.300e+00 < x <= 5.400e+00 | \n", "0.0000 | \n", "0.0300 | \n", "3 | \n", "
| 5.400e+00 < x <= 5.500e+00 | \n", "0.6667 | \n", "0.0600 | \n", "6 | \n", "
| 5.500e+00 < x <= 5.600e+00 | \n", "0.6667 | \n", "0.0300 | \n", "3 | \n", "
| 5.600e+00 < x <= 5.700e+00 | \n", "0.5000 | \n", "0.0400 | \n", "4 | \n", "
| 5.700e+00 < x <= 5.800e+00 | \n", "0.4000 | \n", "0.0500 | \n", "5 | \n", "
| 5.800e+00 < x <= 5.900e+00 | \n", "0.6667 | \n", "0.0300 | \n", "3 | \n", "
| 5.900e+00 < x <= 6.000e+00 | \n", "0.6667 | \n", "0.0300 | \n", "3 | \n", "
| 6.000e+00 < x <= 6.100e+00 | \n", "1.0000 | \n", "0.0300 | \n", "3 | \n", "
| 6.100e+00 < x <= 6.200e+00 | \n", "0.6667 | \n", "0.0300 | \n", "3 | \n", "
| 6.200e+00 < x <= 6.300e+00 | \n", "0.2857 | \n", "0.0700 | \n", "7 | \n", "
| 6.300e+00 < x <= 6.400e+00 | \n", "0.2500 | \n", "0.0400 | \n", "4 | \n", "
| 6.400e+00 < x <= 6.500e+00 | \n", "0.5000 | \n", "0.0200 | \n", "2 | \n", "
| 6.500e+00 < x <= 6.700e+00 | \n", "0.6667 | \n", "0.0600 | \n", "6 | \n", "
| 6.700e+00 < x <= 6.800e+00 | \n", "0.3333 | \n", "0.0300 | \n", "3 | \n", "
| 6.800e+00 < x <= 6.900e+00 | \n", "0.3333 | \n", "0.0300 | \n", "3 | \n", "
| 6.900e+00 < x <= 7.100e+00 | \n", "0.5000 | \n", "0.0200 | \n", "2 | \n", "
| 7.100e+00 < x <= 7.200e+00 | \n", "0.0000 | \n", "0.0300 | \n", "3 | \n", "
| 7.200e+00 < x <= 7.600e+00 | \n", "0.0000 | \n", "0.0200 | \n", "2 | \n", "
| 7.600e+00 < x <= 7.700e+00 | \n", "0.0000 | \n", "0.0300 | \n", "3 | \n", "
| 7.700e+00 < x <= 7.900e+00 | \n", "0.0000 | \n", "0.0100 | \n", "1 | \n", "
| 7.900e+00 < x | \n", "nan | \n", "0.0000 | \n", "0 | \n", "
| target_mean | \n", "frequency | \n", "count | \n", "
|---|---|---|
| 0.0000 | \n", "0.0400 | \n", "2 | \n", "
| 0.0000 | \n", "0.0400 | \n", "2 | \n", "
| 0.0000 | \n", "0.0200 | \n", "1 | \n", "
| nan | \n", "0.0000 | \n", "0 | \n", "
| 0.3333 | \n", "0.0600 | \n", "3 | \n", "
| 0.3333 | \n", "0.0600 | \n", "3 | \n", "
| 0.0000 | \n", "0.0600 | \n", "3 | \n", "
| 0.0000 | \n", "0.0200 | \n", "1 | \n", "
| nan | \n", "0.0000 | \n", "0 | \n", "
| 0.3333 | \n", "0.0600 | \n", "3 | \n", "
| 1.0000 | \n", "0.0200 | \n", "1 | \n", "
| 1.0000 | \n", "0.0600 | \n", "3 | \n", "
| 0.7500 | \n", "0.0800 | \n", "4 | \n", "
| 0.5000 | \n", "0.0400 | \n", "2 | \n", "
| nan | \n", "0.0000 | \n", "0 | \n", "
| 0.6667 | \n", "0.0600 | \n", "3 | \n", "
| 0.3333 | \n", "0.0600 | \n", "3 | \n", "
| 0.0000 | \n", "0.0200 | \n", "1 | \n", "
| 0.5000 | \n", "0.0400 | \n", "2 | \n", "
| 0.3333 | \n", "0.0600 | \n", "3 | \n", "
| 0.0000 | \n", "0.0600 | \n", "3 | \n", "
| 0.2500 | \n", "0.0800 | \n", "4 | \n", "
| nan | \n", "0.0000 | \n", "0 | \n", "
| 0.0000 | \n", "0.0200 | \n", "1 | \n", "
| nan | \n", "0.0000 | \n", "0 | \n", "
| nan | \n", "0.0000 | \n", "0 | \n", "
| 0.0000 | \n", "0.0200 | \n", "1 | \n", "
| 0.0000 | \n", "0.0200 | \n", "1 | \n", "
| nan | \n", "0.0000 | \n", "0 | \n", "
| nan | \n", "0.0000 | \n", "0 | \n", "
| \n", " | target_mean | \n", "frequency | \n", "count | \n", "
|---|---|---|---|
| x <= 5.40e+00 | \n", "0.0882 | \n", "0.3400 | \n", "34 | \n", "
| 5.40e+00 < x <= 7.10e+00 | \n", "0.5263 | \n", "0.5700 | \n", "57 | \n", "
| 7.10e+00 < x | \n", "0.0000 | \n", "0.0900 | \n", "9 | \n", "
| target_mean | \n", "frequency | \n", "count | \n", "
|---|---|---|
| 0.1667 | \n", "0.3600 | \n", "18 | \n", "
| 0.4667 | \n", "0.6000 | \n", "30 | \n", "
| 0.0000 | \n", "0.0400 | \n", "2 | \n", "
| \n", " | target_mean | \n", "frequency | \n", "count | \n", "
|---|---|---|---|
| x <= 2.000e+00 | \n", "1.0000 | \n", "0.0100 | \n", "1 | \n", "
| 2.000e+00 < x <= 2.200e+00 | \n", "0.6667 | \n", "0.0300 | \n", "3 | \n", "
| 2.200e+00 < x <= 2.400e+00 | \n", "1.0000 | \n", "0.0300 | \n", "3 | \n", "
| 2.400e+00 < x <= 2.500e+00 | \n", "0.6000 | \n", "0.0500 | \n", "5 | \n", "
| 2.500e+00 < x <= 2.600e+00 | \n", "0.7500 | \n", "0.0400 | \n", "4 | \n", "
| 2.600e+00 < x <= 2.700e+00 | \n", "0.5714 | \n", "0.0700 | \n", "7 | \n", "
| 2.700e+00 < x <= 2.800e+00 | \n", "0.4444 | \n", "0.0900 | \n", "9 | \n", "
| 2.800e+00 < x <= 2.900e+00 | \n", "0.6667 | \n", "0.0600 | \n", "6 | \n", "
| 2.900e+00 < x <= 3.000e+00 | \n", "0.2857 | \n", "0.1400 | \n", "14 | \n", "
| 3.000e+00 < x <= 3.100e+00 | \n", "0.3333 | \n", "0.0900 | \n", "9 | \n", "
| 3.100e+00 < x <= 3.200e+00 | \n", "0.2222 | \n", "0.0900 | \n", "9 | \n", "
| 3.200e+00 < x <= 3.300e+00 | \n", "0.0000 | \n", "0.0400 | \n", "4 | \n", "
| 3.300e+00 < x <= 3.400e+00 | \n", "0.0000 | \n", "0.0600 | \n", "6 | \n", "
| 3.400e+00 < x <= 3.500e+00 | \n", "0.0000 | \n", "0.0600 | \n", "6 | \n", "
| 3.500e+00 < x <= 3.600e+00 | \n", "0.0000 | \n", "0.0300 | \n", "3 | \n", "
| 3.600e+00 < x <= 3.700e+00 | \n", "0.0000 | \n", "0.0100 | \n", "1 | \n", "
| 3.700e+00 < x <= 3.800e+00 | \n", "0.0000 | \n", "0.0500 | \n", "5 | \n", "
| 3.800e+00 < x <= 4.100e+00 | \n", "0.0000 | \n", "0.0300 | \n", "3 | \n", "
| 4.100e+00 < x | \n", "0.0000 | \n", "0.0200 | \n", "2 | \n", "
| target_mean | \n", "frequency | \n", "count | \n", "
|---|---|---|
| nan | \n", "0.0000 | \n", "0 | \n", "
| nan | \n", "0.0000 | \n", "0 | \n", "
| 0.7500 | \n", "0.0800 | \n", "4 | \n", "
| 0.3333 | \n", "0.0600 | \n", "3 | \n", "
| 0.0000 | \n", "0.0200 | \n", "1 | \n", "
| 0.5000 | \n", "0.0400 | \n", "2 | \n", "
| 0.4000 | \n", "0.1000 | \n", "5 | \n", "
| 0.7500 | \n", "0.0800 | \n", "4 | \n", "
| 0.3333 | \n", "0.2400 | \n", "12 | \n", "
| 0.0000 | \n", "0.0400 | \n", "2 | \n", "
| 0.2500 | \n", "0.0800 | \n", "4 | \n", "
| 0.5000 | \n", "0.0400 | \n", "2 | \n", "
| 0.1667 | \n", "0.1200 | \n", "6 | \n", "
| nan | \n", "0.0000 | \n", "0 | \n", "
| 0.0000 | \n", "0.0200 | \n", "1 | \n", "
| 0.0000 | \n", "0.0400 | \n", "2 | \n", "
| 0.0000 | \n", "0.0200 | \n", "1 | \n", "
| 0.0000 | \n", "0.0200 | \n", "1 | \n", "
| nan | \n", "0.0000 | \n", "0 | \n", "
| \n", " | target_mean | \n", "frequency | \n", "count | \n", "
|---|---|---|---|
| x <= 2.9e+00 | \n", "0.6316 | \n", "0.3800 | \n", "38 | \n", "
| 2.9e+00 < x | \n", "0.1452 | \n", "0.6200 | \n", "62 | \n", "
| target_mean | \n", "frequency | \n", "count | \n", "
|---|---|---|
| 0.5263 | \n", "0.3800 | \n", "19 | \n", "
| 0.2258 | \n", "0.6200 | \n", "31 | \n", "
| \n", " | target_mean | \n", "frequency | \n", "count | \n", "
|---|---|---|---|
| x <= 1.100e+00 | \n", "0.0000 | \n", "0.0100 | \n", "1 | \n", "
| 1.100e+00 < x <= 1.300e+00 | \n", "0.0000 | \n", "0.0300 | \n", "3 | \n", "
| 1.300e+00 < x <= 1.400e+00 | \n", "0.0000 | \n", "0.1100 | \n", "11 | \n", "
| 1.400e+00 < x <= 1.500e+00 | \n", "0.0000 | \n", "0.0900 | \n", "9 | \n", "
| 1.500e+00 < x <= 1.600e+00 | \n", "0.0000 | \n", "0.0700 | \n", "7 | \n", "
| 1.600e+00 < x <= 1.900e+00 | \n", "0.0000 | \n", "0.0300 | \n", "3 | \n", "
| 1.900e+00 < x <= 3.500e+00 | \n", "1.0000 | \n", "0.0300 | \n", "3 | \n", "
| 3.500e+00 < x <= 3.700e+00 | \n", "1.0000 | \n", "0.0200 | \n", "2 | \n", "
| 3.700e+00 < x <= 4.000e+00 | \n", "1.0000 | \n", "0.0700 | \n", "7 | \n", "
| 4.000e+00 < x <= 4.200e+00 | \n", "1.0000 | \n", "0.0300 | \n", "3 | \n", "
| 4.200e+00 < x <= 4.300e+00 | \n", "1.0000 | \n", "0.0200 | \n", "2 | \n", "
| 4.300e+00 < x <= 4.400e+00 | \n", "1.0000 | \n", "0.0400 | \n", "4 | \n", "
| 4.400e+00 < x <= 4.500e+00 | \n", "1.0000 | \n", "0.0100 | \n", "1 | \n", "
| 4.500e+00 < x <= 4.600e+00 | \n", "1.0000 | \n", "0.0300 | \n", "3 | \n", "
| 4.600e+00 < x <= 4.700e+00 | \n", "1.0000 | \n", "0.0300 | \n", "3 | \n", "
| 4.700e+00 < x <= 4.800e+00 | \n", "0.6667 | \n", "0.0300 | \n", "3 | \n", "
| 4.800e+00 < x <= 4.900e+00 | \n", "0.5000 | \n", "0.0400 | \n", "4 | \n", "
| 4.900e+00 < x <= 5.000e+00 | \n", "0.0000 | \n", "0.0300 | \n", "3 | \n", "
| 5.000e+00 < x <= 5.100e+00 | \n", "0.1667 | \n", "0.0600 | \n", "6 | \n", "
| 5.100e+00 < x <= 5.400e+00 | \n", "0.0000 | \n", "0.0200 | \n", "2 | \n", "
| 5.400e+00 < x <= 5.600e+00 | \n", "0.0000 | \n", "0.0500 | \n", "5 | \n", "
| 5.600e+00 < x <= 5.700e+00 | \n", "0.0000 | \n", "0.0300 | \n", "3 | \n", "
| 5.700e+00 < x <= 5.900e+00 | \n", "0.0000 | \n", "0.0300 | \n", "3 | \n", "
| 5.900e+00 < x <= 6.100e+00 | \n", "0.0000 | \n", "0.0500 | \n", "5 | \n", "
| 6.100e+00 < x <= 6.600e+00 | \n", "0.0000 | \n", "0.0200 | \n", "2 | \n", "
| 6.600e+00 < x | \n", "0.0000 | \n", "0.0200 | \n", "2 | \n", "
| target_mean | \n", "frequency | \n", "count | \n", "
|---|---|---|
| 0.0000 | \n", "0.0200 | \n", "1 | \n", "
| 0.0000 | \n", "0.1200 | \n", "6 | \n", "
| 0.0000 | \n", "0.0400 | \n", "2 | \n", "
| 0.0000 | \n", "0.0800 | \n", "4 | \n", "
| nan | \n", "0.0000 | \n", "0 | \n", "
| 0.0000 | \n", "0.0600 | \n", "3 | \n", "
| 1.0000 | \n", "0.0400 | \n", "2 | \n", "
| nan | \n", "0.0000 | \n", "0 | \n", "
| 1.0000 | \n", "0.0400 | \n", "2 | \n", "
| 1.0000 | \n", "0.0800 | \n", "4 | \n", "
| nan | \n", "0.0000 | \n", "0 | \n", "
| nan | \n", "0.0000 | \n", "0 | \n", "
| 0.8571 | \n", "0.1400 | \n", "7 | \n", "
| nan | \n", "0.0000 | \n", "0 | \n", "
| 1.0000 | \n", "0.0400 | \n", "2 | \n", "
| 0.0000 | \n", "0.0200 | \n", "1 | \n", "
| 0.0000 | \n", "0.0200 | \n", "1 | \n", "
| 1.0000 | \n", "0.0200 | \n", "1 | \n", "
| 0.0000 | \n", "0.0400 | \n", "2 | \n", "
| 0.0000 | \n", "0.0800 | \n", "4 | \n", "
| 0.0000 | \n", "0.0800 | \n", "4 | \n", "
| nan | \n", "0.0000 | \n", "0 | \n", "
| 0.0000 | \n", "0.0400 | \n", "2 | \n", "
| nan | \n", "0.0000 | \n", "0 | \n", "
| 0.0000 | \n", "0.0200 | \n", "1 | \n", "
| 0.0000 | \n", "0.0200 | \n", "1 | \n", "
| \n", " | target_mean | \n", "frequency | \n", "count | \n", "
|---|---|---|---|
| x <= 1.90e+00 | \n", "0.0000 | \n", "0.3400 | \n", "34 | \n", "
| 1.90e+00 < x <= 4.80e+00 | \n", "0.9677 | \n", "0.3100 | \n", "31 | \n", "
| 4.80e+00 < x | \n", "0.0857 | \n", "0.3500 | \n", "35 | \n", "
| target_mean | \n", "frequency | \n", "count | \n", "
|---|---|---|
| 0.0000 | \n", "0.3200 | \n", "16 | \n", "
| 0.8889 | \n", "0.3600 | \n", "18 | \n", "
| 0.0625 | \n", "0.3200 | \n", "16 | \n", "
| \n", " | target_mean | \n", "frequency | \n", "count | \n", "
|---|---|---|---|
| x <= 1.000e-01 | \n", "0.0000 | \n", "0.0500 | \n", "5 | \n", "
| 1.000e-01 < x <= 2.000e-01 | \n", "0.0000 | \n", "0.1700 | \n", "17 | \n", "
| 2.000e-01 < x <= 3.000e-01 | \n", "0.0000 | \n", "0.0500 | \n", "5 | \n", "
| 3.000e-01 < x <= 4.000e-01 | \n", "0.0000 | \n", "0.0600 | \n", "6 | \n", "
| 4.000e-01 < x <= 6.000e-01 | \n", "0.0000 | \n", "0.0100 | \n", "1 | \n", "
| 6.000e-01 < x <= 1.000e+00 | \n", "1.0000 | \n", "0.0400 | \n", "4 | \n", "
| 1.000e+00 < x <= 1.100e+00 | \n", "1.0000 | \n", "0.0200 | \n", "2 | \n", "
| 1.100e+00 < x <= 1.200e+00 | \n", "1.0000 | \n", "0.0500 | \n", "5 | \n", "
| 1.200e+00 < x <= 1.300e+00 | \n", "1.0000 | \n", "0.0800 | \n", "8 | \n", "
| 1.300e+00 < x <= 1.400e+00 | \n", "1.0000 | \n", "0.0600 | \n", "6 | \n", "
| 1.400e+00 < x <= 1.500e+00 | \n", "0.8571 | \n", "0.0700 | \n", "7 | \n", "
| 1.500e+00 < x <= 1.600e+00 | \n", "0.5000 | \n", "0.0200 | \n", "2 | \n", "
| 1.600e+00 < x <= 1.800e+00 | \n", "0.1429 | \n", "0.0700 | \n", "7 | \n", "
| 1.800e+00 < x <= 1.900e+00 | \n", "0.0000 | \n", "0.0400 | \n", "4 | \n", "
| 1.900e+00 < x <= 2.000e+00 | \n", "0.0000 | \n", "0.0400 | \n", "4 | \n", "
| 2.000e+00 < x <= 2.100e+00 | \n", "0.0000 | \n", "0.0600 | \n", "6 | \n", "
| 2.100e+00 < x <= 2.200e+00 | \n", "0.0000 | \n", "0.0100 | \n", "1 | \n", "
| 2.200e+00 < x <= 2.300e+00 | \n", "0.0000 | \n", "0.0500 | \n", "5 | \n", "
| 2.300e+00 < x <= 2.400e+00 | \n", "0.0000 | \n", "0.0200 | \n", "2 | \n", "
| 2.400e+00 < x <= 2.500e+00 | \n", "0.0000 | \n", "0.0300 | \n", "3 | \n", "
| 2.500e+00 < x | \n", "nan | \n", "0.0000 | \n", "0 | \n", "
| target_mean | \n", "frequency | \n", "count | \n", "
|---|---|---|
| nan | \n", "0.0000 | \n", "0 | \n", "
| 0.0000 | \n", "0.2400 | \n", "12 | \n", "
| 0.0000 | \n", "0.0400 | \n", "2 | \n", "
| 0.0000 | \n", "0.0200 | \n", "1 | \n", "
| 0.0000 | \n", "0.0200 | \n", "1 | \n", "
| 1.0000 | \n", "0.0600 | \n", "3 | \n", "
| 1.0000 | \n", "0.0200 | \n", "1 | \n", "
| nan | \n", "0.0000 | \n", "0 | \n", "
| 1.0000 | \n", "0.1000 | \n", "5 | \n", "
| 0.5000 | \n", "0.0400 | \n", "2 | \n", "
| 0.8000 | \n", "0.1000 | \n", "5 | \n", "
| 1.0000 | \n", "0.0400 | \n", "2 | \n", "
| 0.1429 | \n", "0.1400 | \n", "7 | \n", "
| 0.0000 | \n", "0.0200 | \n", "1 | \n", "
| 0.0000 | \n", "0.0400 | \n", "2 | \n", "
| nan | \n", "0.0000 | \n", "0 | \n", "
| 0.0000 | \n", "0.0400 | \n", "2 | \n", "
| 0.0000 | \n", "0.0600 | \n", "3 | \n", "
| 0.0000 | \n", "0.0200 | \n", "1 | \n", "
| nan | \n", "0.0000 | \n", "0 | \n", "
| nan | \n", "0.0000 | \n", "0 | \n", "
| \n", " | target_mean | \n", "frequency | \n", "count | \n", "
|---|---|---|---|
| x <= 6.00e-01 | \n", "0.0000 | \n", "0.3400 | \n", "34 | \n", "
| 6.00e-01 < x <= 1.60e+00 | \n", "0.9412 | \n", "0.3400 | \n", "34 | \n", "
| 1.60e+00 < x | \n", "0.0312 | \n", "0.3200 | \n", "32 | \n", "
| target_mean | \n", "frequency | \n", "count | \n", "
|---|---|---|
| 0.0000 | \n", "0.3200 | \n", "16 | \n", "
| 0.8889 | \n", "0.3600 | \n", "18 | \n", "
| 0.0625 | \n", "0.3200 | \n", "16 | \n", "
| \n", " | target_mean | \n", "frequency | \n", "count | \n", "
|---|---|---|---|
| x <= 4.400e+00 | \n", "0.0000 | \n", "0.0200 | \n", "2 | \n", "
| 4.400e+00 < x <= 4.600e+00 | \n", "0.0000 | \n", "0.0300 | \n", "3 | \n", "
| 4.600e+00 < x <= 4.700e+00 | \n", "0.0000 | \n", "0.0100 | \n", "1 | \n", "
| 4.700e+00 < x <= 4.800e+00 | \n", "0.0000 | \n", "0.0500 | \n", "5 | \n", "
| 4.800e+00 < x <= 4.900e+00 | \n", "0.0000 | \n", "0.0300 | \n", "3 | \n", "
| 4.900e+00 < x <= 5.000e+00 | \n", "0.0000 | \n", "0.0700 | \n", "7 | \n", "
| 5.000e+00 < x <= 5.100e+00 | \n", "0.0000 | \n", "0.0600 | \n", "6 | \n", "
| 5.100e+00 < x <= 5.200e+00 | \n", "0.0000 | \n", "0.0300 | \n", "3 | \n", "
| 5.200e+00 < x <= 5.300e+00 | \n", "0.0000 | \n", "0.0100 | \n", "1 | \n", "
| 5.300e+00 < x <= 5.400e+00 | \n", "0.0000 | \n", "0.0300 | \n", "3 | \n", "
| 5.400e+00 < x <= 5.500e+00 | \n", "0.0000 | \n", "0.0600 | \n", "6 | \n", "
| 5.500e+00 < x <= 5.600e+00 | \n", "0.3333 | \n", "0.0300 | \n", "3 | \n", "
| 5.600e+00 < x <= 5.700e+00 | \n", "0.2500 | \n", "0.0400 | \n", "4 | \n", "
| 5.700e+00 < x <= 5.800e+00 | \n", "0.6000 | \n", "0.0500 | \n", "5 | \n", "
| 5.800e+00 < x <= 5.900e+00 | \n", "0.3333 | \n", "0.0300 | \n", "3 | \n", "
| 5.900e+00 < x <= 6.000e+00 | \n", "0.3333 | \n", "0.0300 | \n", "3 | \n", "
| 6.000e+00 < x <= 6.100e+00 | \n", "0.0000 | \n", "0.0300 | \n", "3 | \n", "
| 6.100e+00 < x <= 6.200e+00 | \n", "0.3333 | \n", "0.0300 | \n", "3 | \n", "
| 6.200e+00 < x <= 6.300e+00 | \n", "0.7143 | \n", "0.0700 | \n", "7 | \n", "
| 6.300e+00 < x <= 6.400e+00 | \n", "0.7500 | \n", "0.0400 | \n", "4 | \n", "
| 6.400e+00 < x <= 6.500e+00 | \n", "0.5000 | \n", "0.0200 | \n", "2 | \n", "
| 6.500e+00 < x <= 6.700e+00 | \n", "0.3333 | \n", "0.0600 | \n", "6 | \n", "
| 6.700e+00 < x <= 6.800e+00 | \n", "0.6667 | \n", "0.0300 | \n", "3 | \n", "
| 6.800e+00 < x <= 6.900e+00 | \n", "0.6667 | \n", "0.0300 | \n", "3 | \n", "
| 6.900e+00 < x <= 7.100e+00 | \n", "0.5000 | \n", "0.0200 | \n", "2 | \n", "
| 7.100e+00 < x <= 7.200e+00 | \n", "1.0000 | \n", "0.0300 | \n", "3 | \n", "
| 7.200e+00 < x <= 7.600e+00 | \n", "1.0000 | \n", "0.0200 | \n", "2 | \n", "
| 7.600e+00 < x <= 7.700e+00 | \n", "1.0000 | \n", "0.0300 | \n", "3 | \n", "
| 7.700e+00 < x <= 7.900e+00 | \n", "1.0000 | \n", "0.0100 | \n", "1 | \n", "
| 7.900e+00 < x | \n", "nan | \n", "0.0000 | \n", "0 | \n", "
| target_mean | \n", "frequency | \n", "count | \n", "
|---|---|---|
| 0.0000 | \n", "0.0400 | \n", "2 | \n", "
| 0.0000 | \n", "0.0400 | \n", "2 | \n", "
| 0.0000 | \n", "0.0200 | \n", "1 | \n", "
| nan | \n", "0.0000 | \n", "0 | \n", "
| 0.3333 | \n", "0.0600 | \n", "3 | \n", "
| 0.0000 | \n", "0.0600 | \n", "3 | \n", "
| 0.0000 | \n", "0.0600 | \n", "3 | \n", "
| 0.0000 | \n", "0.0200 | \n", "1 | \n", "
| nan | \n", "0.0000 | \n", "0 | \n", "
| 0.0000 | \n", "0.0600 | \n", "3 | \n", "
| 0.0000 | \n", "0.0200 | \n", "1 | \n", "
| 0.0000 | \n", "0.0600 | \n", "3 | \n", "
| 0.0000 | \n", "0.0800 | \n", "4 | \n", "
| 0.0000 | \n", "0.0400 | \n", "2 | \n", "
| nan | \n", "0.0000 | \n", "0 | \n", "
| 0.3333 | \n", "0.0600 | \n", "3 | \n", "
| 0.6667 | \n", "0.0600 | \n", "3 | \n", "
| 1.0000 | \n", "0.0200 | \n", "1 | \n", "
| 0.5000 | \n", "0.0400 | \n", "2 | \n", "
| 0.6667 | \n", "0.0600 | \n", "3 | \n", "
| 1.0000 | \n", "0.0600 | \n", "3 | \n", "
| 0.7500 | \n", "0.0800 | \n", "4 | \n", "
| nan | \n", "0.0000 | \n", "0 | \n", "
| 1.0000 | \n", "0.0200 | \n", "1 | \n", "
| nan | \n", "0.0000 | \n", "0 | \n", "
| nan | \n", "0.0000 | \n", "0 | \n", "
| 1.0000 | \n", "0.0200 | \n", "1 | \n", "
| 1.0000 | \n", "0.0200 | \n", "1 | \n", "
| nan | \n", "0.0000 | \n", "0 | \n", "
| nan | \n", "0.0000 | \n", "0 | \n", "
| \n", " | target_mean | \n", "frequency | \n", "count | \n", "
|---|---|---|---|
| x <= 6.2e+00 | \n", "0.1250 | \n", "0.6400 | \n", "64 | \n", "
| 6.2e+00 < x | \n", "0.6944 | \n", "0.3600 | \n", "36 | \n", "
| target_mean | \n", "frequency | \n", "count | \n", "
|---|---|---|
| 0.1429 | \n", "0.7000 | \n", "35 | \n", "
| 0.8000 | \n", "0.3000 | \n", "15 | \n", "
| \n", " | target_mean | \n", "frequency | \n", "count | \n", "
|---|---|---|---|
| x <= 2.000e+00 | \n", "0.0000 | \n", "0.0100 | \n", "1 | \n", "
| 2.000e+00 < x <= 2.200e+00 | \n", "0.3333 | \n", "0.0300 | \n", "3 | \n", "
| 2.200e+00 < x <= 2.400e+00 | \n", "0.0000 | \n", "0.0300 | \n", "3 | \n", "
| 2.400e+00 < x <= 2.500e+00 | \n", "0.4000 | \n", "0.0500 | \n", "5 | \n", "
| 2.500e+00 < x <= 2.600e+00 | \n", "0.2500 | \n", "0.0400 | \n", "4 | \n", "
| 2.600e+00 < x <= 2.700e+00 | \n", "0.4286 | \n", "0.0700 | \n", "7 | \n", "
| 2.700e+00 < x <= 2.800e+00 | \n", "0.5556 | \n", "0.0900 | \n", "9 | \n", "
| 2.800e+00 < x <= 2.900e+00 | \n", "0.1667 | \n", "0.0600 | \n", "6 | \n", "
| 2.900e+00 < x <= 3.000e+00 | \n", "0.4286 | \n", "0.1400 | \n", "14 | \n", "
| 3.000e+00 < x <= 3.100e+00 | \n", "0.2222 | \n", "0.0900 | \n", "9 | \n", "
| 3.100e+00 < x <= 3.200e+00 | \n", "0.5556 | \n", "0.0900 | \n", "9 | \n", "
| 3.200e+00 < x <= 3.300e+00 | \n", "0.7500 | \n", "0.0400 | \n", "4 | \n", "
| 3.300e+00 < x <= 3.400e+00 | \n", "0.1667 | \n", "0.0600 | \n", "6 | \n", "
| 3.400e+00 < x <= 3.500e+00 | \n", "0.0000 | \n", "0.0600 | \n", "6 | \n", "
| 3.500e+00 < x <= 3.600e+00 | \n", "0.3333 | \n", "0.0300 | \n", "3 | \n", "
| 3.600e+00 < x <= 3.700e+00 | \n", "0.0000 | \n", "0.0100 | \n", "1 | \n", "
| 3.700e+00 < x <= 3.800e+00 | \n", "0.4000 | \n", "0.0500 | \n", "5 | \n", "
| 3.800e+00 < x <= 4.100e+00 | \n", "0.0000 | \n", "0.0300 | \n", "3 | \n", "
| 4.100e+00 < x | \n", "0.0000 | \n", "0.0200 | \n", "2 | \n", "
| target_mean | \n", "frequency | \n", "count | \n", "
|---|---|---|
| nan | \n", "0.0000 | \n", "0 | \n", "
| nan | \n", "0.0000 | \n", "0 | \n", "
| 0.0000 | \n", "0.0800 | \n", "4 | \n", "
| 0.6667 | \n", "0.0600 | \n", "3 | \n", "
| 1.0000 | \n", "0.0200 | \n", "1 | \n", "
| 0.5000 | \n", "0.0400 | \n", "2 | \n", "
| 0.6000 | \n", "0.1000 | \n", "5 | \n", "
| 0.2500 | \n", "0.0800 | \n", "4 | \n", "
| 0.5000 | \n", "0.2400 | \n", "12 | \n", "
| 1.0000 | \n", "0.0400 | \n", "2 | \n", "
| 0.0000 | \n", "0.0800 | \n", "4 | \n", "
| 0.0000 | \n", "0.0400 | \n", "2 | \n", "
| 0.1667 | \n", "0.1200 | \n", "6 | \n", "
| nan | \n", "0.0000 | \n", "0 | \n", "
| 0.0000 | \n", "0.0200 | \n", "1 | \n", "
| 0.0000 | \n", "0.0400 | \n", "2 | \n", "
| 0.0000 | \n", "0.0200 | \n", "1 | \n", "
| 0.0000 | \n", "0.0200 | \n", "1 | \n", "
| nan | \n", "0.0000 | \n", "0 | \n", "
| \n", " | target_mean | \n", "frequency | \n", "count | \n", "
|---|---|---|---|
| x <= 2.40e+00 | \n", "0.1429 | \n", "0.0700 | \n", "7 | \n", "
| 2.40e+00 < x <= 3.30e+00 | \n", "0.4179 | \n", "0.6700 | \n", "67 | \n", "
| 3.30e+00 < x | \n", "0.1538 | \n", "0.2600 | \n", "26 | \n", "
| target_mean | \n", "frequency | \n", "count | \n", "
|---|---|---|
| 0.0000 | \n", "0.0800 | \n", "4 | \n", "
| 0.4571 | \n", "0.7000 | \n", "35 | \n", "
| 0.0909 | \n", "0.2200 | \n", "11 | \n", "
| \n", " | target_mean | \n", "frequency | \n", "count | \n", "
|---|---|---|---|
| x <= 1.100e+00 | \n", "0.0000 | \n", "0.0100 | \n", "1 | \n", "
| 1.100e+00 < x <= 1.300e+00 | \n", "0.0000 | \n", "0.0300 | \n", "3 | \n", "
| 1.300e+00 < x <= 1.400e+00 | \n", "0.0000 | \n", "0.1100 | \n", "11 | \n", "
| 1.400e+00 < x <= 1.500e+00 | \n", "0.0000 | \n", "0.0900 | \n", "9 | \n", "
| 1.500e+00 < x <= 1.600e+00 | \n", "0.0000 | \n", "0.0700 | \n", "7 | \n", "
| 1.600e+00 < x <= 1.900e+00 | \n", "0.0000 | \n", "0.0300 | \n", "3 | \n", "
| 1.900e+00 < x <= 3.500e+00 | \n", "0.0000 | \n", "0.0300 | \n", "3 | \n", "
| 3.500e+00 < x <= 3.700e+00 | \n", "0.0000 | \n", "0.0200 | \n", "2 | \n", "
| 3.700e+00 < x <= 4.000e+00 | \n", "0.0000 | \n", "0.0700 | \n", "7 | \n", "
| 4.000e+00 < x <= 4.200e+00 | \n", "0.0000 | \n", "0.0300 | \n", "3 | \n", "
| 4.200e+00 < x <= 4.300e+00 | \n", "0.0000 | \n", "0.0200 | \n", "2 | \n", "
| 4.300e+00 < x <= 4.400e+00 | \n", "0.0000 | \n", "0.0400 | \n", "4 | \n", "
| 4.400e+00 < x <= 4.500e+00 | \n", "0.0000 | \n", "0.0100 | \n", "1 | \n", "
| 4.500e+00 < x <= 4.600e+00 | \n", "0.0000 | \n", "0.0300 | \n", "3 | \n", "
| 4.600e+00 < x <= 4.700e+00 | \n", "0.0000 | \n", "0.0300 | \n", "3 | \n", "
| 4.700e+00 < x <= 4.800e+00 | \n", "0.3333 | \n", "0.0300 | \n", "3 | \n", "
| 4.800e+00 < x <= 4.900e+00 | \n", "0.5000 | \n", "0.0400 | \n", "4 | \n", "
| 4.900e+00 < x <= 5.000e+00 | \n", "1.0000 | \n", "0.0300 | \n", "3 | \n", "
| 5.000e+00 < x <= 5.100e+00 | \n", "0.8333 | \n", "0.0600 | \n", "6 | \n", "
| 5.100e+00 < x <= 5.400e+00 | \n", "1.0000 | \n", "0.0200 | \n", "2 | \n", "
| 5.400e+00 < x <= 5.600e+00 | \n", "1.0000 | \n", "0.0500 | \n", "5 | \n", "
| 5.600e+00 < x <= 5.700e+00 | \n", "1.0000 | \n", "0.0300 | \n", "3 | \n", "
| 5.700e+00 < x <= 5.900e+00 | \n", "1.0000 | \n", "0.0300 | \n", "3 | \n", "
| 5.900e+00 < x <= 6.100e+00 | \n", "1.0000 | \n", "0.0500 | \n", "5 | \n", "
| 6.100e+00 < x <= 6.600e+00 | \n", "1.0000 | \n", "0.0200 | \n", "2 | \n", "
| 6.600e+00 < x | \n", "1.0000 | \n", "0.0200 | \n", "2 | \n", "
| target_mean | \n", "frequency | \n", "count | \n", "
|---|---|---|
| 0.0000 | \n", "0.0200 | \n", "1 | \n", "
| 0.0000 | \n", "0.1200 | \n", "6 | \n", "
| 0.0000 | \n", "0.0400 | \n", "2 | \n", "
| 0.0000 | \n", "0.0800 | \n", "4 | \n", "
| nan | \n", "0.0000 | \n", "0 | \n", "
| 0.0000 | \n", "0.0600 | \n", "3 | \n", "
| 0.0000 | \n", "0.0400 | \n", "2 | \n", "
| nan | \n", "0.0000 | \n", "0 | \n", "
| 0.0000 | \n", "0.0400 | \n", "2 | \n", "
| 0.0000 | \n", "0.0800 | \n", "4 | \n", "
| nan | \n", "0.0000 | \n", "0 | \n", "
| nan | \n", "0.0000 | \n", "0 | \n", "
| 0.1429 | \n", "0.1400 | \n", "7 | \n", "
| nan | \n", "0.0000 | \n", "0 | \n", "
| 0.0000 | \n", "0.0400 | \n", "2 | \n", "
| 1.0000 | \n", "0.0200 | \n", "1 | \n", "
| 1.0000 | \n", "0.0200 | \n", "1 | \n", "
| 0.0000 | \n", "0.0200 | \n", "1 | \n", "
| 1.0000 | \n", "0.0400 | \n", "2 | \n", "
| 1.0000 | \n", "0.0800 | \n", "4 | \n", "
| 1.0000 | \n", "0.0800 | \n", "4 | \n", "
| nan | \n", "0.0000 | \n", "0 | \n", "
| 1.0000 | \n", "0.0400 | \n", "2 | \n", "
| nan | \n", "0.0000 | \n", "0 | \n", "
| 1.0000 | \n", "0.0200 | \n", "1 | \n", "
| 1.0000 | \n", "0.0200 | \n", "1 | \n", "
| \n", " | target_mean | \n", "frequency | \n", "count | \n", "
|---|---|---|---|
| x <= 4.8e+00 | \n", "0.0154 | \n", "0.6500 | \n", "65 | \n", "
| 4.8e+00 < x | \n", "0.9143 | \n", "0.3500 | \n", "35 | \n", "
| target_mean | \n", "frequency | \n", "count | \n", "
|---|---|---|
| 0.0588 | \n", "0.6800 | \n", "34 | \n", "
| 0.9375 | \n", "0.3200 | \n", "16 | \n", "
| \n", " | target_mean | \n", "frequency | \n", "count | \n", "
|---|---|---|---|
| x <= 1.000e-01 | \n", "0.0000 | \n", "0.0500 | \n", "5 | \n", "
| 1.000e-01 < x <= 2.000e-01 | \n", "0.0000 | \n", "0.1700 | \n", "17 | \n", "
| 2.000e-01 < x <= 3.000e-01 | \n", "0.0000 | \n", "0.0500 | \n", "5 | \n", "
| 3.000e-01 < x <= 4.000e-01 | \n", "0.0000 | \n", "0.0600 | \n", "6 | \n", "
| 4.000e-01 < x <= 6.000e-01 | \n", "0.0000 | \n", "0.0100 | \n", "1 | \n", "
| 6.000e-01 < x <= 1.000e+00 | \n", "0.0000 | \n", "0.0400 | \n", "4 | \n", "
| 1.000e+00 < x <= 1.100e+00 | \n", "0.0000 | \n", "0.0200 | \n", "2 | \n", "
| 1.100e+00 < x <= 1.200e+00 | \n", "0.0000 | \n", "0.0500 | \n", "5 | \n", "
| 1.200e+00 < x <= 1.300e+00 | \n", "0.0000 | \n", "0.0800 | \n", "8 | \n", "
| 1.300e+00 < x <= 1.400e+00 | \n", "0.0000 | \n", "0.0600 | \n", "6 | \n", "
| 1.400e+00 < x <= 1.500e+00 | \n", "0.1429 | \n", "0.0700 | \n", "7 | \n", "
| 1.500e+00 < x <= 1.600e+00 | \n", "0.5000 | \n", "0.0200 | \n", "2 | \n", "
| 1.600e+00 < x <= 1.800e+00 | \n", "0.8571 | \n", "0.0700 | \n", "7 | \n", "
| 1.800e+00 < x <= 1.900e+00 | \n", "1.0000 | \n", "0.0400 | \n", "4 | \n", "
| 1.900e+00 < x <= 2.000e+00 | \n", "1.0000 | \n", "0.0400 | \n", "4 | \n", "
| 2.000e+00 < x <= 2.100e+00 | \n", "1.0000 | \n", "0.0600 | \n", "6 | \n", "
| 2.100e+00 < x <= 2.200e+00 | \n", "1.0000 | \n", "0.0100 | \n", "1 | \n", "
| 2.200e+00 < x <= 2.300e+00 | \n", "1.0000 | \n", "0.0500 | \n", "5 | \n", "
| 2.300e+00 < x <= 2.400e+00 | \n", "1.0000 | \n", "0.0200 | \n", "2 | \n", "
| 2.400e+00 < x <= 2.500e+00 | \n", "1.0000 | \n", "0.0300 | \n", "3 | \n", "
| 2.500e+00 < x | \n", "nan | \n", "0.0000 | \n", "0 | \n", "
| target_mean | \n", "frequency | \n", "count | \n", "
|---|---|---|
| nan | \n", "0.0000 | \n", "0 | \n", "
| 0.0000 | \n", "0.2400 | \n", "12 | \n", "
| 0.0000 | \n", "0.0400 | \n", "2 | \n", "
| 0.0000 | \n", "0.0200 | \n", "1 | \n", "
| 0.0000 | \n", "0.0200 | \n", "1 | \n", "
| 0.0000 | \n", "0.0600 | \n", "3 | \n", "
| 0.0000 | \n", "0.0200 | \n", "1 | \n", "
| nan | \n", "0.0000 | \n", "0 | \n", "
| 0.0000 | \n", "0.1000 | \n", "5 | \n", "
| 0.5000 | \n", "0.0400 | \n", "2 | \n", "
| 0.2000 | \n", "0.1000 | \n", "5 | \n", "
| 0.0000 | \n", "0.0400 | \n", "2 | \n", "
| 0.8571 | \n", "0.1400 | \n", "7 | \n", "
| 1.0000 | \n", "0.0200 | \n", "1 | \n", "
| 1.0000 | \n", "0.0400 | \n", "2 | \n", "
| nan | \n", "0.0000 | \n", "0 | \n", "
| 1.0000 | \n", "0.0400 | \n", "2 | \n", "
| 1.0000 | \n", "0.0600 | \n", "3 | \n", "
| 1.0000 | \n", "0.0200 | \n", "1 | \n", "
| nan | \n", "0.0000 | \n", "0 | \n", "
| nan | \n", "0.0000 | \n", "0 | \n", "
| \n", " | target_mean | \n", "frequency | \n", "count | \n", "
|---|---|---|---|
| x <= 1.5e+00 | \n", "0.0152 | \n", "0.6600 | \n", "66 | \n", "
| 1.5e+00 < x | \n", "0.9412 | \n", "0.3400 | \n", "34 | \n", "
| target_mean | \n", "frequency | \n", "count | \n", "
|---|---|---|
| 0.0625 | \n", "0.6400 | \n", "32 | \n", "
| 0.8333 | \n", "0.3600 | \n", "18 | \n", "
| \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | content | \n", "target_mean | \n", "frequency | \n", "dropped | \n", "dropped_reason | \n", "
|---|---|---|---|---|---|---|---|---|---|---|
| feature | \n", "count | \n", "cramerv | \n", "tschuprowt | \n", "n_mod | \n", "label | \n", "\n", " | \n", " | \n", " | \n", " | \n", " |
| Quantitative('sepal length (cm)__y=versicolor') | \n", "34.0 | \n", "0.483288 | \n", "0.406395 | \n", "3 | \n", "0 | \n", "x <= 5.40e+00 | \n", "0.088235 | \n", "0.34 | \n", "False | \n", "None | \n", "
| 57.0 | \n", "0.483288 | \n", "0.406395 | \n", "3 | \n", "1 | \n", "5.40e+00 < x <= 7.10e+00 | \n", "0.526316 | \n", "0.57 | \n", "False | \n", "None | \n", "|
| 9.0 | \n", "0.483288 | \n", "0.406395 | \n", "3 | \n", "2 | \n", "7.10e+00 < x | \n", "0.000000 | \n", "0.09 | \n", "False | \n", "None | \n", "|
| Quantitative('sepal width (cm)__y=versicolor') | \n", "38.0 | \n", "0.480207 | \n", "0.480207 | \n", "2 | \n", "0 | \n", "x <= 2.9e+00 | \n", "0.631579 | \n", "0.38 | \n", "False | \n", "None | \n", "
| 62.0 | \n", "0.480207 | \n", "0.480207 | \n", "2 | \n", "1 | \n", "2.9e+00 < x | \n", "0.145161 | \n", "0.62 | \n", "False | \n", "None | \n", "|
| Quantitative('petal length (cm)__y=versicolor') | \n", "34.0 | \n", "0.912237 | \n", "0.767096 | \n", "3 | \n", "0 | \n", "x <= 1.90e+00 | \n", "0.000000 | \n", "0.34 | \n", "False | \n", "None | \n", "
| 31.0 | \n", "0.912237 | \n", "0.767096 | \n", "3 | \n", "1 | \n", "1.90e+00 < x <= 4.80e+00 | \n", "0.967742 | \n", "0.31 | \n", "False | \n", "None | \n", "|
| 35.0 | \n", "0.912237 | \n", "0.767096 | \n", "3 | \n", "2 | \n", "4.80e+00 < x | \n", "0.085714 | \n", "0.35 | \n", "False | \n", "None | \n", "|
| Quantitative('petal width (cm)__y=versicolor') | \n", "34.0 | \n", "0.933300 | \n", "0.784809 | \n", "3 | \n", "0 | \n", "x <= 6.00e-01 | \n", "0.000000 | \n", "0.34 | \n", "False | \n", "None | \n", "
| 1 | \n", "6.00e-01 < x <= 1.60e+00 | \n", "0.941176 | \n", "0.34 | \n", "False | \n", "None | \n", "|||||
| 32.0 | \n", "0.933300 | \n", "0.784809 | \n", "3 | \n", "2 | \n", "1.60e+00 < x | \n", "0.031250 | \n", "0.32 | \n", "False | \n", "None | \n", "|
| Quantitative('sepal length (cm)__y=virginica') | \n", "64.0 | \n", "0.559144 | \n", "0.559144 | \n", "2 | \n", "0 | \n", "x <= 6.2e+00 | \n", "0.125000 | \n", "0.64 | \n", "False | \n", "None | \n", "
| 36.0 | \n", "0.559144 | \n", "0.559144 | \n", "2 | \n", "1 | \n", "6.2e+00 < x | \n", "0.694444 | \n", "0.36 | \n", "False | \n", "None | \n", "|
| Quantitative('sepal width (cm)__y=virginica') | \n", "7.0 | \n", "0.266452 | \n", "0.224058 | \n", "3 | \n", "0 | \n", "x <= 2.40e+00 | \n", "0.142857 | \n", "0.07 | \n", "False | \n", "None | \n", "
| 67.0 | \n", "0.266452 | \n", "0.224058 | \n", "3 | \n", "1 | \n", "2.40e+00 < x <= 3.30e+00 | \n", "0.417910 | \n", "0.67 | \n", "False | \n", "None | \n", "|
| 26.0 | \n", "0.266452 | \n", "0.224058 | \n", "3 | \n", "2 | \n", "3.30e+00 < x | \n", "0.153846 | \n", "0.26 | \n", "False | \n", "None | \n", "|
| Quantitative('petal length (cm)__y=virginica') | \n", "65.0 | \n", "0.889524 | \n", "0.889524 | \n", "2 | \n", "0 | \n", "x <= 4.8e+00 | \n", "0.015385 | \n", "0.65 | \n", "False | \n", "None | \n", "
| 35.0 | \n", "0.889524 | \n", "0.889524 | \n", "2 | \n", "1 | \n", "4.8e+00 < x | \n", "0.914286 | \n", "0.35 | \n", "False | \n", "None | \n", "|
| Quantitative('petal width (cm)__y=virginica') | \n", "66.0 | \n", "0.910463 | \n", "0.910463 | \n", "2 | \n", "0 | \n", "x <= 1.5e+00 | \n", "0.015152 | \n", "0.66 | \n", "False | \n", "None | \n", "
| 34.0 | \n", "0.910463 | \n", "0.910463 | \n", "2 | \n", "1 | \n", "1.5e+00 < x | \n", "0.941176 | \n", "0.34 | \n", "False | \n", "None | \n", "
| \n", " | info | \n", "cramerv | \n", "tschuprowt | \n", "combination | \n", "n_mod | \n", "dropna | \n", "train | \n", "viable | \n", "dev | \n", "
|---|---|---|---|---|---|---|---|---|---|
| 0 | \n", "Raw distribution (n_mod=30>max_n_mod=5) | \n", "0.590426 | \n", "0.254429 | \n", "{'x <= 4.400e+00': 'x <= 4.400e+00', '4.400e+0... | \n", "30 | \n", "False | \n", "NaN | \n", "NaN | \n", "NaN | \n", "
| 1 | \n", "Best for tschuprowt and max_n_mod=5 | \n", "0.483288 | \n", "0.406395 | \n", "{'x <= 4.400e+00': 'x <= 4.400e+00', '4.400e+0... | \n", "3 | \n", "False | \n", "{'viable': True, 'info': ''} | \n", "True | \n", "{'viable': True, 'info': ''} | \n", "
| \n", " | sepal length (cm)__y=versicolor | \n", "sepal width (cm)__y=versicolor | \n", "petal length (cm)__y=versicolor | \n", "petal width (cm)__y=versicolor | \n", "sepal length (cm)__y=virginica | \n", "sepal width (cm)__y=virginica | \n", "petal length (cm)__y=virginica | \n", "petal width (cm)__y=virginica | \n", "
|---|---|---|---|---|---|---|---|---|
| 0.0 | \n", "0.36 | \n", "0.38 | \n", "0.32 | \n", "0.32 | \n", "0.7 | \n", "0.08 | \n", "0.68 | \n", "0.64 | \n", "
| 1.0 | \n", "0.60 | \n", "0.62 | \n", "0.36 | \n", "0.36 | \n", "0.3 | \n", "0.70 | \n", "0.32 | \n", "0.36 | \n", "
| 2.0 | \n", "0.04 | \n", "NaN | \n", "0.32 | \n", "0.32 | \n", "NaN | \n", "0.22 | \n", "NaN | \n", "NaN | \n", "
| \n", " | feature | \n", "Nan | \n", "Mode | \n", "TschuprowtMeasure | \n", "TschuprowtRank | \n", "TschuprowtFilter | \n", "TschuprowtWith | \n", "
|---|---|---|---|---|---|---|---|
| 3 | \n", "Quantitative('petal width (cm)__y=versicolor') | \n", "0.0000 | \n", "0.3400 | \n", "0.9558 | \n", "0.0000 | \n", "0.0000 | \n", "itself | \n", "
| 2 | \n", "Quantitative('petal length (cm)__y=versicolor') | \n", "0.0000 | \n", "0.3500 | \n", "0.9421 | \n", "1.0000 | \n", "0.9018 | \n", "petal width (cm)__y=versicolor | \n", "
| 7 | \n", "Quantitative('petal width (cm)__y=virginica') | \n", "0.0000 | \n", "0.6600 | \n", "0.7857 | \n", "2.0000 | \n", "0.8049 | \n", "petal width (cm)__y=versicolor | \n", "
| 6 | \n", "Quantitative('petal length (cm)__y=virginica') | \n", "0.0000 | \n", "0.6500 | \n", "0.7695 | \n", "3.0000 | \n", "0.8675 | \n", "petal width (cm)__y=virginica | \n", "
| 0 | \n", "Quantitative('sepal length (cm)__y=versicolor') | \n", "0.0000 | \n", "0.5700 | \n", "0.6713 | \n", "4.0000 | \n", "0.6649 | \n", "petal length (cm)__y=versicolor | \n", "
| 4 | \n", "Quantitative('sepal length (cm)__y=virginica') | \n", "0.0000 | \n", "0.6400 | \n", "0.5441 | \n", "5.0000 | \n", "0.6071 | \n", "petal length (cm)__y=virginica | \n", "
| 1 | \n", "Quantitative('sepal width (cm)__y=versicolor') | \n", "0.0000 | \n", "0.6200 | \n", "0.4950 | \n", "6.0000 | \n", "0.5042 | \n", "petal width (cm)__y=versicolor | \n", "
| 5 | \n", "Quantitative('sepal width (cm)__y=virginica') | \n", "0.0000 | \n", "0.6700 | \n", "0.4868 | \n", "7.0000 | \n", "0.5107 | \n", "petal width (cm)__y=versicolor | \n", "
| \n", " | petal width (cm)__y=versicolor | \n", "petal length (cm)__y=versicolor | \n", "petal width (cm)__y=virginica | \n", "petal length (cm)__y=virginica | \n", "
|---|---|---|---|---|
| 136 | \n", "2.0 | \n", "2.0 | \n", "1.0 | \n", "1.0 | \n", "
| 17 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "
| 142 | \n", "2.0 | \n", "2.0 | \n", "1.0 | \n", "1.0 | \n", "
| 59 | \n", "1.0 | \n", "1.0 | \n", "0.0 | \n", "0.0 | \n", "
| 6 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "
XGBClassifier(base_score=None, booster=None, callbacks=None,\n",
" colsample_bylevel=None, colsample_bynode=None,\n",
" colsample_bytree=None, device=None, early_stopping_rounds=None,\n",
" enable_categorical=False, eval_metric=None, feature_types=None,\n",
" feature_weights=None, gamma=None, grow_policy=None,\n",
" importance_type=None, interaction_constraints=None,\n",
" learning_rate=None, max_bin=None, max_cat_threshold=None,\n",
" max_cat_to_onehot=None, max_delta_step=None, max_depth=None,\n",
" max_leaves=None, min_child_weight=None, missing=nan,\n",
" monotone_constraints=None, multi_strategy=None, n_estimators=None,\n",
" n_jobs=None, num_parallel_tree=None, ...)In a Jupyter environment, please rerun this cell to show the HTML representation or trust the notebook.