【人工知能(AI)】衛星データとPythonで農作物の収穫量を最大化する

2023年11月12日2024年1月25日

ChatGPTとの連携

人工知能(AI)と衛星データ、Pythonで農作物の収穫量を最大化する手順を紹介します。

Pythonによるサンプルデータの生成

まず、サンプルデータを生成します。このデータは、衛星データを模倣しており、農作物の種類、気象条件、土壌の状態、収穫量などを含んでいます。

import pandas as pd
import numpy as np

# データの生成
np.random.seed(0)
num_samples = 1000
crops = np.random.choice(['wheat', 'rice', 'corn'], num_samples) 
weather = np.random.choice(['sunny', 'cloudy', 'rainy'], num_samples)
soil_quality = np.random.normal(50, 10, num_samples)
yield_amount = soil_quality * np.random.uniform(0.8, 1.2, num_samples) + np.random.normal(0, 5, num_samples)

# DataFrameの作成
df = pd.DataFrame({
    'Crop': crops,
    'Weather': weather,
    'SoilQuality': soil_quality,
    'Yield': yield_amount
})

データの可視化

次に、データを可視化して、その特徴を理解します。

import matplotlib.pyplot as plt
import seaborn as sns

# 収穫量の分布
plt.figure(figsize=(10, 6))
sns.histplot(df['Yield'], kde=True)
plt.title('Yield Distribution')
plt.xlabel('Yield')
plt.ylabel('Frequency')
plt.show()

# 土壌品質と収穫量の関係
plt.figure(figsize=(10, 6))
sns.scatterplot(x='SoilQuality', y='Yield', hue='Crop', data=df)
plt.title('Soil Quality vs Yield by Crop Type')
plt.xlabel('Soil Quality')
plt.ylabel('Yield')
plt.show()

予測モデルの構築

収穫量の予測モデルを構築します。ここでは、ランダムフォレスト回帰モデルを使用します。

from sklearn.model_selection import train_test_split
from sklearn.ensemble import RandomForestRegressor
from sklearn.metrics import mean_squared_error

# データの準備
df_dummy = pd.get_dummies(df, drop_first=True)
X = df_dummy.drop('Yield', axis=1)
y = df_dummy['Yield']

# データの分割
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

# モデルの構築と訓練
model = RandomForestRegressor(n_estimators=100, random_state=42)
model.fit(X_train, y_train)

# 性能評価
predictions = model.predict(X_test)
mse = mean_squared_error(y_test, predictions)
print(f"Mean Squared Error: {mse}")