import numpy as np import matplotlib.pyplot as plt from sklearn.datasets import load_iris from sklearn.model_selection import train_test_split from sklearn.linear_model import LogisticRegression from sklearn.metrics import accuracy_score, classification_report
三、数据准备与探索
1. 加载数据集
本文使用经典的鸢尾花(Iris)数据集,包含150条样本,分为3个类别。
1 2 3
iris = load_iris() X = iris.data y = iris.target
2. 数据集划分
将数据集分为训练集和测试集,比例为70%训练,30%测试。
1
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.3, random_state=42)