How to use Foundation Models in Azure Machine Learning??

In recent years we have seen how advances in artificial intelligence and Machine Learning have led to the emergence of large Foundation Models that are pre-trained with a large amount of data.

We analyze what they consist of, their advantages when using them in Azure Machine Learning, and how to use them.

What are Foundation Models in Azure Machine Learning?

Foundation Models are a starting point for developing specialized models, which can be easily adapted to multiple applications in different industries. In fact, these models have positioned themselves as a unique opportunity for companies to create and use in their Deep Learning workloads.

Using them in Azure Machine Learning provides native Azure ML capabilities that allow these open-source models to be deployed at scale. They can be easily integrated into business applications and include capabilities such as:

Discover: this allows you to review model descriptions, test sample inference, and search for code samples to evaluate, tune, or implement the model.
Evaluate: This allows you to check if the model suits your specific workload by providing your own test data. This makes it easy to visualize the selected model.
Finetune: allows you to organize your training work and find the model that best suits your needs.
Deploy: you can deploy pre-trained base models or fitted models on online endpoints for real-time inference or to process large datasets.
Import: you can use the latest models by importing models similar to those in the catalog.

Catalog of models and collections

This is a hub for finding Foundation Models in Azure Machine Learning and is a starting point for exploring these models. You will be able to search and filter models according to the tasks they are capable of. For now, only models work with text, but whispers that can work with audio have also been deployed.

This catalog has, at the moment, two collections of models: Open source models selected by Azure Machine Learning (ready for immediate use and optimized, natively supported, and easily migrated) and Transformers models from the HuggingFace center (thousands of models for real-time inference with online endpoints).

The latter service is the creator of the leading open-source library for creating state-of-the-art ML models. It allows you to deploy machine learning models on a dedicated connection point to Azure’s enterprise-grade infrastructure. It allows you to choose from tens of thousands of ML models for natural language processing, audio, and machine vision to accelerate your workload. It also streamlines inference with easy deployment and helps keep our data private and secure.

How to use the Foundation Models selected by Azure Machine Learning?

As mentioned above, the Foundation Models in Azure Machine Learning provide native functionality for discovering, evaluating, tuning, deploying, and running these open-source models.

To access these models, you’ll need to go to the Azure Machine Learning Studio, a hub for discovering the Foundation Models catalog. You’ll see the most popular models, open-source LLMs, and more tasks coming soon.

You will have the option to filter by task or license and then select a specific model name, where you can read a card describing the details of the model:

Task: indicates the inference task for which this pre-trained model can be used.
Finetuning tasks: lists the tasks for which this model can be adjusted.
License: indicates the license information.

Thanks to the model card, you can quickly test any model using the sample inference widget, which will give you your own sample input to test the result.

How to evaluate Foundation Models using your own test data?

You can evaluate a model against your test data set in two ways: via the “Evaluate UI Wizard” or code-based examples.

In the UI Wizard evaluation, each model can be evaluated for a specific inference task:

Test data: Pass the test data you want to evaluate by uploading a local file or selecting a set of data recorded in your workspace. Once selected, assign the input data columns according to the schema you need for each task.
Compute: Provide the Azure ML cluster with what you want to use to tune the model (must run on CPU compute and with sufficient compute quota). Select “Finish” in the evaluation wizard. Once the work is complete, you can view the model metrics and decide if you want to tune the model using your own training data.

Advanced evaluation parameters: Besides the basic evaluation, the wizard includes several advanced evaluation parameters, including default values that can be customized through code-based samples.

¿ How to fit models with your own training data?

To improve the model’s performance in your workload, you can make adjustments using your own training data easily using the Finetune wizard or code-based examples linked from the model card.

Each pre-trained model in the catalog can be adjusted for a specific set of tasks; just select it from the drop-down menu. Pass the training data by uploading a local file or selecting a dataset from your workspace.

Then pass the data to validate by selecting “Automatic split.” Also, pass any test data you want to use to evaluate the fitted model. An automatic split of the training data will be reserved for testing.

Next, provide the cluster of the process you want to tune, where we recommend using GPU A100/V100 compute SKUs. Finally, select “Finish” in the wizard to submit your fine-tuning job.

You will find several advanced tuning parameters, such as learning rate, epochs, batch size, etc.

Machine Learning Models

At Plain Concepts, we help companies manage their Machine Learning projects by providing expert guidance on AI and MLOps, including assessing current capabilities and applying industry standard practices to maintain a production-ready ML environment.

We are one of the first companies to earn the AI and Machine Learning on Microsoft Azure Advanced Specialization, so we can assist you in implementing solutions for the lifecycle of machine learning and AI-driven applications.

If you are ready to start or advance your project but don’t know how we can help. Contact us, and our experts will study your case to find a way to get the most out of your business.

Cookie	Duration	Description
__cfduid	1 year	The cookie is used by cdn services like CloudFare to identify individual clients behind a shared IP address and apply security settings on a per-client basis. It does not correspond to any user ID in the web application and does not store any personally identifiable information.
__cfduid	29 days 23 hours 59 minutes	The cookie is used by cdn services like CloudFare to identify individual clients behind a shared IP address and apply security settings on a per-client basis. It does not correspond to any user ID in the web application and does not store any personally identifiable information.
__cfduid	1 year	The cookie is used by cdn services like CloudFare to identify individual clients behind a shared IP address and apply security settings on a per-client basis. It does not correspond to any user ID in the web application and does not store any personally identifiable information.
__cfduid	29 days 23 hours 59 minutes	The cookie is used by cdn services like CloudFare to identify individual clients behind a shared IP address and apply security settings on a per-client basis. It does not correspond to any user ID in the web application and does not store any personally identifiable information.
_ga	1 year	This cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assign a randomly generated number to identify unique visitors.
_ga	1 year	This cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assign a randomly generated number to identify unique visitors.
_ga	1 year	This cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assign a randomly generated number to identify unique visitors.
_ga	1 year	This cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assign a randomly generated number to identify unique visitors.
_gat_UA-326213-2	1 year	No description
_gat_UA-326213-2	1 year	No description
_gat_UA-326213-2	1 year	No description
_gat_UA-326213-2	1 year	No description
_gid	1 year	This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the wbsite is doing. The data collected including the number visitors, the source where they have come from, and the pages viisted in an anonymous form.
_gid	1 year	This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the wbsite is doing. The data collected including the number visitors, the source where they have come from, and the pages viisted in an anonymous form.
_gid	1 year	This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the wbsite is doing. The data collected including the number visitors, the source where they have come from, and the pages viisted in an anonymous form.
_gid	1 year	This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the wbsite is doing. The data collected including the number visitors, the source where they have come from, and the pages viisted in an anonymous form.
attributionCookie	session	No description
cookielawinfo-checkbox-analytics	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Analytics" category .
cookielawinfo-checkbox-necessary	1 year	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-necessary	1 year	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-non-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Non Necessary".
cookielawinfo-checkbox-non-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Non Necessary".
cookielawinfo-checkbox-non-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Non Necessary".
cookielawinfo-checkbox-non-necessary	1 year	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Non Necessary".
cookielawinfo-checkbox-performance	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to store the user consent for cookies in the category "Performance".
cppro-ft	1 year	No description
cppro-ft	7 years 1 months 12 days 23 hours 59 minutes	No description
cppro-ft	7 years 1 months 12 days 23 hours 59 minutes	No description
cppro-ft	1 year	No description
cppro-ft-style	1 year	No description
cppro-ft-style	1 year	No description
cppro-ft-style	session	No description
cppro-ft-style	session	No description
cppro-ft-style-temp	23 hours 59 minutes	No description
cppro-ft-style-temp	23 hours 59 minutes	No description
cppro-ft-style-temp	23 hours 59 minutes	No description
cppro-ft-style-temp	1 year	No description
i18n	10 years	No description available.
IE-jwt	62 years 6 months 9 days 9 hours	No description
IE-LANG_CODE	62 years 6 months 9 days 9 hours	No description
IE-set_country	62 years 6 months 9 days 9 hours	No description
JSESSIONID	session	The JSESSIONID cookie is used by New Relic to store a session identifier so that New Relic can monitor session counts for an application.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
viewed_cookie_policy	1 year	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
viewed_cookie_policy	1 year	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
wmc	9 years 11 months 30 days 11 hours 59 minutes	No description

Cookie	Duration	Description
__cf_bm	30 minutes	This cookie, set by Cloudflare, is used to support Cloudflare Bot Management.
sp_landing	1 day	The sp_landing is set by Spotify to implement audio content from Spotify on the website and also registers information on user interaction related to the audio content.
sp_t	1 year	The sp_t cookie is set by Spotify to implement audio content from Spotify on the website and also registers information on user interaction related to the audio content.

Cookie	Duration	Description
_hjAbsoluteSessionInProgress	1 year	No description
_hjAbsoluteSessionInProgress	1 year	No description
_hjAbsoluteSessionInProgress	1 year	No description
_hjAbsoluteSessionInProgress	1 year	No description
_hjFirstSeen	29 minutes	No description
_hjFirstSeen	29 minutes	No description
_hjFirstSeen	29 minutes	No description
_hjFirstSeen	1 year	No description
_hjid	11 months 29 days 23 hours 59 minutes	This cookie is set by Hotjar. This cookie is set when the customer first lands on a page with the Hotjar script. It is used to persist the random user ID, unique to that site on the browser. This ensures that behavior in subsequent visits to the same site will be attributed to the same user ID.
_hjid	11 months 29 days 23 hours 59 minutes	This cookie is set by Hotjar. This cookie is set when the customer first lands on a page with the Hotjar script. It is used to persist the random user ID, unique to that site on the browser. This ensures that behavior in subsequent visits to the same site will be attributed to the same user ID.
_hjid	1 year	This cookie is set by Hotjar. This cookie is set when the customer first lands on a page with the Hotjar script. It is used to persist the random user ID, unique to that site on the browser. This ensures that behavior in subsequent visits to the same site will be attributed to the same user ID.
_hjid	1 year	This cookie is set by Hotjar. This cookie is set when the customer first lands on a page with the Hotjar script. It is used to persist the random user ID, unique to that site on the browser. This ensures that behavior in subsequent visits to the same site will be attributed to the same user ID.
_hjIncludedInPageviewSample	1 year	No description
_hjIncludedInPageviewSample	1 year	No description
_hjIncludedInPageviewSample	1 year	No description
_hjIncludedInPageviewSample	1 year	No description
_hjSession_1776154	session	No description
_hjSessionUser_1776154	session	No description
_hjTLDTest	1 year	No description
_hjTLDTest	1 year	No description
_hjTLDTest	session	No description
_hjTLDTest	session	No description
_lfa_test_cookie_stored	past	No description

Cookie	Duration	Description
loglevel	never	No description available.
prism_90878714	1 month	No description
redirectFacebook	2 minutes	No description
YSC	session	YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt.innertube::nextId	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.