Data

ML.NET 0.4 is here with support for SymSGD, F#, and word embeddings transform!A

2 min read

After the release of ML.NET 0.1 at //Build 2018, back in May, the Microsoft team announced ML.NET 0.4 last week. The latest release includes features such as Word Embedding Transform, SymSGD Learner, and improvements to F# API and samples for ML.NET.

ML.NET is a cross-platform, open source machine learning framework for .NET developers. Let’s have a quick look at the major features in ML.NET 0.4.

Word Embeddings Transform for Text Scenarios

Word embeddings is a method which allows mapping words to numeric vectors for capturing the meaning of the words. This is used for visualization or model training.

With ML.NET 0.4, the word embedding transform is added to ML.NET allowing you to use pre-trained or existing word embedding models in pipelines. There are several different pretrained models such as GloVe, fastText, and SSWE which are available. Adding this transform method along with the existing transforms help improve the model’s metrics.

SymSGD Learner for Binary Classification

SymSGD is now available in ML.NET 0.4 for binary classification. SymSGD is a parallel SGD algorithm which retains the sequential semantics of SGD. It offers a much better performance based on multithreading. It is fast, scales well on multiple cores, and achieves the same accuracy as sequential SGD.

SymSGD can be applied to any linear learner where update rule is linear like a binary classification or a linear regression.

This is how you can add a SymSGD Binary Classifier learner to the pipeline:

pipeline.Add(new SymSgdBinaryClassifier() { NumberOfThreads = 1});

Even though there is no multithreading enabled in SymSGD at the moment, it can be helpful in cases where you want to try many different learners and limit each of them to a single thread.

Improvements to F# API and samples for ML.NET

ML.NET did not provide support for F# records. With ML.NET 0.4 release, you can use property-based row classes in F#.

To get more coverage, check out the official Microsoft blog.

Read Next

Microsoft Open Sources ML.NET, cross-platform machine learning framework

Create machine learning pipelines using unsupervised AutoML [Tutorial

Top AutoML libraries for building your ML pipelines

 

Natasha Mathur

Tech writer at the Packt Hub. Dreamer, book nerd, lover of scented candles, karaoke, and Gilmore Girls.

Share
Published by
Natasha Mathur
Tags: AI News

Recent Posts

Top life hacks for prepping for your IT certification exam

I remember deciding to pursue my first IT certification, the CompTIA A+. I had signed…

3 years ago

Learn Transformers for Natural Language Processing with Denis Rothman

Key takeaways The transformer architecture has proved to be revolutionary in outperforming the classical RNN…

3 years ago

Learning Essential Linux Commands for Navigating the Shell Effectively

Once we learn how to deploy an Ubuntu server, how to manage users, and how…

3 years ago

Clean Coding in Python with Mariano Anaya

Key-takeaways:   Clean code isn’t just a nice thing to have or a luxury in software projects; it's a necessity. If we…

3 years ago

Exploring Forms in Angular – types, benefits and differences   

While developing a web application, or setting dynamic pages and meta tags we need to deal with…

3 years ago

Gain Practical Expertise with the Latest Edition of Software Architecture with C# 9 and .NET 5

Software architecture is one of the most discussed topics in the software industry today, and…

3 years ago