Name	Name	Last commit message	Last commit date
parent directory ..
docs	docs
scripts	scripts
src	src
README.md	README.md
THIRD-PARTY-NOTICES.txt	THIRD-PARTY-NOTICES.txt
eShopDashboardML.sln	eShopDashboardML.sln

eShopDashboardML - Sales forecasting

ML.NET version	API type	Status	App Type	Data type	Scenario	ML Task	Algorithms
v1.0.0	Dynamic API	Up-to-date	ASP.NET Core web app and Console app	SQL Server and .csv files	Sales forecast	Regression	FastTreeTweedie Regression

eShopDashboardML is a web app with Sales Forecast predictions (per product and per country) using Microsoft Machine Learning .NET (ML.NET).

Overview

This end-to-end sample app highlights the usage of ML.NET API by showing the following topics:

How to train, build and generate ML models
- Implemented as a console app using .NET Core.
How to predict the next month of Sales Forecasts by using the trained ML model
- Implemented as a single, monolithic web app using ASP.NET Core Razor.

The app is also using a SQL Server database for regular product catalog and orders info, as many typical web apps using SQL Server. In this case, since it is an example, it is, by default, using a localdb SQL database so there's no need to setup a real SQL Server. The localdb database will be created, along with sample populated data, the first time you run the web app.

If you want to use a real SQL Server or Azure SQL Database, you just need to change the connection string in the app.

When you run the app, it opens the webpage with a serach box says "Type a product". You can type for any product say "bottle". Then a list of products related to keyword "bottle" will show in autocomplete suggestions. Once you select any product then the sales forecast of that product will be shown as below.

Here's a sample screenshot of the web app and one of the forecast predictions:

Walkthroughs on how to set it up

Learn how to set it up in Visual Studio plus further explanations on the code:

Setting up eShopDashboard in Visual Studio and running the web app
Create and Train your ML models
- This step is optional as the web app is already configured to use a pre-trained model. But you can create your own trained model and swap the pre-trained model with your own.

Walkthrough on the implemented ML.NET code

Problem

This problem is centered around country and product forecasting based on previous sales

DataSet

To solve this problem, you build two independent ML models that take the following datasets as input:

Data Set	columns
products stats	next, productId, year, month, units, avg, count, max, min, prev
country stats	next, country, year, month, max, min, std, count, sales, med, prev

Explanation of Dataset - Goto this link for detailed information on dataset.

ML task - Regression

The ML Task for this sample is a Regression, which is a supervised machine learning task that is used to predict the value of the next period (in this case the sales prediction) from a set of related features/variables.

Solution

To solve this problem, first we will build the ML models while training each model on existing data, evaluate how good it is, and finally you consume the model to predict sales.

Note that the sample implements two independent models:

Model to predict product's demand forecast for the next period (month)
Model to predict country's sales forecast for the next period (month)

However, when learning/researching the sample, you can focus just on one of the scenarios/models.

1. Build Model

STEP 1: Define the schema of data in a class type and refer that type while loading data using TextLoader. Here the class type is ProductData.

Schema in a class type

 public class ProductData
    {
        // next,productId,year,month,units,avg,count,max,min,prev
        //The index of column in LoadColumn(int index) should be matched with the position of columns in file.
        [LoadColumn(0)]
        public float next;

        [LoadColumn(1)]
        public string productId;

        [LoadColumn(2)]
        public float year;

        [LoadColumn(3)]
        public float month;

        [LoadColumn(4)]
        public float units;

        [LoadColumn(5)]
        public float avg;

        [LoadColumn(6)]
        public float count;

        [LoadColumn(7)]
        public float max;

        [LoadColumn(8)]
        public float min;

        [LoadColumn(9)]
        public float prev;
    }

Model build and train

Load the dataset into the DataView.


var trainingDataView = mlContext.Data.LoadFromTextFile<ProductData>(dataPath, hasHeader: true, separatorChar:',');

Build the pipeline transformations and to specify what trainer/algorithm you are going to use. In this case you are doing the following transformations:

Concat current features to a new Column named NumFeatures
Transform productId using one-hot encoding
Concat all generated fetures in one column named 'Features'
Copy next column to rename it to "Label"
Specify the "Fast Tree Tweedie" Trainer as the algorithm to apply to the model

You can load the dataset either before or after designing the pipeline. Although this step is just configuration, it is lazy and won't be loaded until training the model in the next step.

var trainer = mlContext.Regression.Trainers.FastTreeTweedie("Label", "Features");

var trainingPipeline = mlContext.Transforms.Concatenate(outputColumnName: "NumFeatures", nameof(CountryData.year),
                                nameof(CountryData.month), nameof(CountryData.max), nameof(CountryData.min),
                                nameof(CountryData.std), nameof(CountryData.count), nameof(CountryData.sales),
                                nameof(CountryData.med), nameof(CountryData.prev))
                    .Append(mlContext.Transforms.Categorical.OneHotEncoding(outputColumnName: "CatFeatures", inputColumnName: nameof(CountryData.country)))
                    .Append(mlContext.Transforms.Concatenate(outputColumnName: "Features", "NumFeatures", "CatFeatures"))
                    .Append(mlContext.Transforms.CopyColumns(outputColumnName: "Label", inputColumnName: nameof(CountryData.next)))
                    .Append(trainer);

2. Evaluate model with cross-validation

In this case, the evaluation of the model is performed before training the model with a cross-validation approach, so you obtain metrics telling you how good is the accuracy of the model.

var crossValidationResults = mlContext.Regression.CrossValidate(data:trainingDataView, estimator:trainingPipeline, numberOfFolds: 6, labelColumnName: "Label");
            
ConsoleHelper.PrintRegressionFoldsAverageMetrics(trainer.ToString(), crossValidationResults);

3. Train model

After building the pipeline, we train the forecast model by fitting or using the training data with the selected algorithm. In that step, the model is built, trained and returned as an object:

var model = trainingPipeline.Fit(trainingDataView);

4. Save the model for later comsumption from end-user apps

Once the model is created and evaluated, you can save it into a .ZIP file which could be consumed by any end-user application with the following code:

using (var file = File.OpenWrite(outputModelPath))
                mlContext.Model.Save(model, trainingDataView.Schema, file);

5. Try the model with a simple test prediction

Basically, you can load the model from the .ZIP file create some sample data, create the "prediction function" and finally you make a prediction.

ITransformer trainedModel;
using (var stream = File.OpenRead(outputModelPath))
{
    trainedModel = mlContext.Model.Load(stream,out var modelInputSchema);
}

var predictionEngine = mlContext.Model.CreatePredictionEngine<CountryData, CountrySalesPrediction>(trainedModel);

Console.WriteLine("** Testing Product 1 **");

// Build sample data
ProductData dataSample = new ProductData()
{
    productId = "263",
    month = 10,
    year = 2017,
    avg = 91,
    max = 370,
    min = 1,
    count = 10,
    prev = 1675,
    units = 910
};

// Predict the nextperiod/month forecast to the one provided
ProductUnitPrediction prediction = predictionEngine.Predict(dataSample);
Console.WriteLine($"Product: {dataSample.productId}, month: {dataSample.month + 1}, year: {dataSample.year} - Real value (units): 551, Forecast Prediction (units): {prediction.Score}");

Citation

eShopDashboardML dataset is based on a public Online Retail Dataset from UCI: http://archive.ics.uci.edu/ml/datasets/online+retail

Daqing Chen, Sai Liang Sain, and Kun Guo, Data mining for the online retail industry: A case study of RFM model-based customer segmentation using data mining, Journal of Database Marketing and Customer Strategy Management, Vol. 19, No. 3, pp. 197â€“208, 2012 (Published online before print: 27 August 2012. doi: 10.1057/dbm.2012.17).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Regression-SalesForecast

Regression-SalesForecast

README.md

eShopDashboardML - Sales forecasting

Overview

Walkthroughs on how to set it up

Walkthrough on the implemented ML.NET code

Problem

DataSet

ML task - Regression

Solution

1. Build Model

2. Evaluate model with cross-validation

3. Train model

4. Save the model for later comsumption from end-user apps

5. Try the model with a simple test prediction

Citation

Files

Regression-SalesForecast

Directory actions

More options

Directory actions

More options

Latest commit

History

Regression-SalesForecast

Folders and files

parent directory

README.md

eShopDashboardML - Sales forecasting

Overview

Walkthroughs on how to set it up

Walkthrough on the implemented ML.NET code

Problem

DataSet

ML task - Regression

Solution

1. Build Model

2. Evaluate model with cross-validation

3. Train model

4. Save the model for later comsumption from end-user apps

5. Try the model with a simple test prediction

Citation