AGRARIAN/vineyards-yield-dataset
Viewer • Updated • 926 • 6
This project demonstrates machine learning models to predict vines yield at harvest time using remote sensing data, weather information, soil properties, and agronomic attributes.
The models predict:
training_features_anonymized.csv - Dataset for model training (926 rows × 589 columns)BuildModels_open_source.ipynb - Self-contained notebook for training prediction modelsThis notebook trains machine learning models to predict vines yield (TCH) at harvest time.
training_features_anonymized.csv (926 harvest observations)Load and prepare data
Train models using Leave-One-Season-Out Cross-Validation
Evaluate performance
Save final models
.joblib files for future usetch_model.joblib - Yield prediction modeltch_encoders.joblib - Label encoders for categorical variablesWeather + Soil + Extra:
numpy>=1.21.0
pandas>=1.3.0
scikit-learn>=1.0.0
lightgbm>=3.3.0
matplotlib>=3.4.0
seaborn>=0.11.0
jupyter>=1.0.0
joblib>=1.0.0
# Install dependencies
pip install numpy pandas scikit-learn lightgbm matplotlib seaborn jupyter joblib
# Navigate to the notebook directory
cd open_source_model/
# Launch Jupyter
jupyter notebook
# Open BuildModels_open_source.ipynb and run all cells
import joblib
import pandas as pd
# Load models and encoders
tch_model = joblib.load('tch_model.joblib')
tch_encoders = joblib.load('tch_encoders.joblib')
# Prepare your data (must have the same features)
# X = pd.DataFrame(...) # Your feature data
# Make predictions
tch_predictions = tch_model.predict(X)