Elena Canorea
Communications Lead
Introduction
Microsoft launched Fabric, its all-in-one data and analytics platform, a few months ago, covering everything from data movement and ETL processes to data science, real-time analytics, business intelligence, and visualization.
Strong data governance is essential to a well-functioning enterprise and OneLake offers a unified and cohesive datalake solution for all Fabric experiences. We take a look at all its benefits, use cases, and key features.
OneLake is a unified data lake platform that centralizes all of a company’s data in one place (similar to how OneDrive centralizes documents). This centralized infrastructure facilitates cross-team collaboration, eliminates information silos, and improves operational efficiency by avoiding the dispersion of data across multiple systems.
Its intuitive interface and advanced capabilities make Microsoft OneLake an essential tool for those looking to optimize the management of their digital assets.
Unlike other products such as OneDrive (designed for personal and team file and document storage), OneLake specializes in enterprise-scale data management and analysis. This includes advanced capabilities for processing large volumes of data and support for Machine Learning operations and predictive analytics. In addition, OneLake is built on Azure Data Lake infrastructure, which ensures high compatibility and performance with other Azure tools dedicated to analytics and AI.
OneLake has emerged as the solution to the challenges of the previous numerous lakes for different business groups or the overhead of managing multiple resources, improving collaboration.
Knowing where a client’s collaboration begins and ends provides a natural boundary of governance and compliance, which is under the control of a tenant manager.
Within a tenant, OneLake can create any number of workspaces, which allow different parts of the organization to distribute ownership and access policies.
Within a workspace, data elements can be created and all data in OneLake can be accessed through them. Fabric stores lakehouses, warehouses, and other elements in OneLake.
OneLake is built on Azure Data Lake Storage Gen2 and can support any file type, structured or unstructured. All Fabric data elements, such as data warehouses and lakehouses, automatically store their data in OneLake in Delta Parquet format.
OneLake supports the same ADLS Gen2 APIs and SDKs to be compatible with existing ADLS Gen2 applications, including Azure Databricks. You can approach data in OneLake as one large ADLS storage account for the entire organization. Each workspace appears as a container within that storage account, and the different data items appear as folders within those containers.
Allows you to easily browse OneLake data from Windows using the OneLake file explorer. This makes navigating through all workspaces and data elements easy, and easily upload, download, or modify files.
This explorer simplifies working with data lakes, allowing even non-technical business users to use it.
OneLake provides more value from a single copy of data with no data movement or duplication. With this tool, you no longer need to copy data only for use with another engine or to eliminate silos for analysis with data from other sources.
In addition, shortcuts make it easy to share data between users and applications without having to move or duplicate information unnecessarily. When teams work independently in separate workspaces, these shortcuts allow them to combine data from different business groups and domains into a virtual data product to meet a user’s specific needs.
In fact, with Fabric, the different analytics engines store data in Delta Parquet’s open format to allow the same data to be used across multiple engines. Business users can create Power BI reports directly on OneLake using the new Direct Lake mode in the Analysis Services engine. The Analysis Services engine is what drives Power BI semantic models and has always offered two ways to access data: import and direct query.
Microsoft OneLake offers a great scalable storage solution, as well as security or integration with other Microsoft tools, but brings many more benefits:
As the heart of Microsoft Fabric and the foundational data layer, OneLake offers a unified and cohesive datalake solution for all Fabric experiences. Designed as SaaS, the solution combines different areas into an integrated analytics system with OneLake as the foundational data layer.
A company’s data is often distributed or spread across different areas or departments, yet they may need access to the same or similar data sources. If data sources are copied across domains, it may not be clear who owns the initial source, which can lead to a lack of trust in data products and teams.
The new updates bring value to Fabric and OneLake technology in several ways:
By leveraging these two capabilities, organizations can effortlessly establish virtualized lakes, breaking down silos between various entities, eliminating the waiting period for IT to establish and maintain pipelines, and reducing resource allocation and time for data migration.
By acting as a unified data lake, with shortcuts, data can be referenced in different locations while still being logically represented within the same lake.
These shortcuts allow links to be created that point to a particular storage location. This storage location can be internal or external.
On the other hand, an important aspect of reducing output costs is the caching of shortcuts between clouds. These shortcuts can be created from lakehouses or KQL databases and managed programmatically through designated APIs.
This feature is a major step towards distributed data ownership in OneLake, helping to unify the data landscape and a breakthrough for the Fabric community.
Another feature is database replication. This offers the ability to maintain data synchronization between two databases without the need to create complex ETL pipelines. This allows you to replicate a database or a subset of database tables in Fabric’s OneLake, keeping it up to date in near real-time. In addition, there are no further storage costs associated with mirroring, as this feature is implemented by reading transaction log changes from the source database and updating the relevant data.
One of the main advantages is that it protects operational databases from analytical queries. With many join and grouping operations, they can be quite demanding for operational DBs, and Mirroring helps reduce the workload. Another advantage is the ability to cross-query between databases directly in Fabric. In addition, it supports table schema evolution and supported data type changes.
However, the most significant advantage is the potential for real-time analytics combined with reduced time to value and the ability to quickly create powerful data products. Power BI can access mirrored database tables with Direct Lake mode, providing the ability to easily create dashboards and reports for near real-time monitoring.
In a rapidly changing business environment, the ability to analyze data instantly has become a necessity, and thanks to it, companies gain the ability to monitor events in real-time.
This allows you to react quickly to changes and solve potential problems. And at Plain Concepts we help you get the most out of it.
We offer you a Microsoft Fabric Adoption Framework with which we will evaluate the technological and business solutions, we will make a clear roadmap for the data strategy, we visualize the use cases that make the difference in your company, we take into account the sizing of equipment, time and costs, we study the compatibility with existing data platforms and we migrate Power BI, Synapse and Datawarehouse solutions to Fabric.
Elena Canorea
Communications Lead
Cookie | Duration | Description |
---|---|---|
__cfduid | 1 year | The cookie is used by cdn services like CloudFare to identify individual clients behind a shared IP address and apply security settings on a per-client basis. It does not correspond to any user ID in the web application and does not store any personally identifiable information. |
__cfduid | 29 days 23 hours 59 minutes | The cookie is used by cdn services like CloudFare to identify individual clients behind a shared IP address and apply security settings on a per-client basis. It does not correspond to any user ID in the web application and does not store any personally identifiable information. |
__cfduid | 1 year | The cookie is used by cdn services like CloudFare to identify individual clients behind a shared IP address and apply security settings on a per-client basis. It does not correspond to any user ID in the web application and does not store any personally identifiable information. |
__cfduid | 29 days 23 hours 59 minutes | The cookie is used by cdn services like CloudFare to identify individual clients behind a shared IP address and apply security settings on a per-client basis. It does not correspond to any user ID in the web application and does not store any personally identifiable information. |
_ga | 1 year | This cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assign a randomly generated number to identify unique visitors. |
_ga | 1 year | This cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assign a randomly generated number to identify unique visitors. |
_ga | 1 year | This cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assign a randomly generated number to identify unique visitors. |
_ga | 1 year | This cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assign a randomly generated number to identify unique visitors. |
_gat_UA-326213-2 | 1 year | No description |
_gat_UA-326213-2 | 1 year | No description |
_gat_UA-326213-2 | 1 year | No description |
_gat_UA-326213-2 | 1 year | No description |
_gid | 1 year | This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the wbsite is doing. The data collected including the number visitors, the source where they have come from, and the pages viisted in an anonymous form. |
_gid | 1 year | This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the wbsite is doing. The data collected including the number visitors, the source where they have come from, and the pages viisted in an anonymous form. |
_gid | 1 year | This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the wbsite is doing. The data collected including the number visitors, the source where they have come from, and the pages viisted in an anonymous form. |
_gid | 1 year | This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the wbsite is doing. The data collected including the number visitors, the source where they have come from, and the pages viisted in an anonymous form. |
attributionCookie | session | No description |
cookielawinfo-checkbox-analytics | 1 year | Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Analytics" category . |
cookielawinfo-checkbox-necessary | 1 year | This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary". |
cookielawinfo-checkbox-necessary | 11 months | This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary". |
cookielawinfo-checkbox-necessary | 11 months | This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary". |
cookielawinfo-checkbox-necessary | 1 year | This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary". |
cookielawinfo-checkbox-non-necessary | 11 months | This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Non Necessary". |
cookielawinfo-checkbox-non-necessary | 11 months | This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Non Necessary". |
cookielawinfo-checkbox-non-necessary | 11 months | This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Non Necessary". |
cookielawinfo-checkbox-non-necessary | 1 year | This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Non Necessary". |
cookielawinfo-checkbox-performance | 1 year | Set by the GDPR Cookie Consent plugin, this cookie is used to store the user consent for cookies in the category "Performance". |
cppro-ft | 1 year | No description |
cppro-ft | 7 years 1 months 12 days 23 hours 59 minutes | No description |
cppro-ft | 7 years 1 months 12 days 23 hours 59 minutes | No description |
cppro-ft | 1 year | No description |
cppro-ft-style | 1 year | No description |
cppro-ft-style | 1 year | No description |
cppro-ft-style | session | No description |
cppro-ft-style | session | No description |
cppro-ft-style-temp | 23 hours 59 minutes | No description |
cppro-ft-style-temp | 23 hours 59 minutes | No description |
cppro-ft-style-temp | 23 hours 59 minutes | No description |
cppro-ft-style-temp | 1 year | No description |
i18n | 10 years | No description available. |
IE-jwt | 62 years 6 months 9 days 9 hours | No description |
IE-LANG_CODE | 62 years 6 months 9 days 9 hours | No description |
IE-set_country | 62 years 6 months 9 days 9 hours | No description |
JSESSIONID | session | The JSESSIONID cookie is used by New Relic to store a session identifier so that New Relic can monitor session counts for an application. |
viewed_cookie_policy | 11 months | The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data. |
viewed_cookie_policy | 1 year | The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data. |
viewed_cookie_policy | 1 year | The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data. |
viewed_cookie_policy | 11 months | The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data. |
VISITOR_INFO1_LIVE | 5 months 27 days | A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface. |
wmc | 9 years 11 months 30 days 11 hours 59 minutes | No description |
Cookie | Duration | Description |
---|---|---|
__cf_bm | 30 minutes | This cookie, set by Cloudflare, is used to support Cloudflare Bot Management. |
sp_landing | 1 day | The sp_landing is set by Spotify to implement audio content from Spotify on the website and also registers information on user interaction related to the audio content. |
sp_t | 1 year | The sp_t cookie is set by Spotify to implement audio content from Spotify on the website and also registers information on user interaction related to the audio content. |
Cookie | Duration | Description |
---|---|---|
_hjAbsoluteSessionInProgress | 1 year | No description |
_hjAbsoluteSessionInProgress | 1 year | No description |
_hjAbsoluteSessionInProgress | 1 year | No description |
_hjAbsoluteSessionInProgress | 1 year | No description |
_hjFirstSeen | 29 minutes | No description |
_hjFirstSeen | 29 minutes | No description |
_hjFirstSeen | 29 minutes | No description |
_hjFirstSeen | 1 year | No description |
_hjid | 11 months 29 days 23 hours 59 minutes | This cookie is set by Hotjar. This cookie is set when the customer first lands on a page with the Hotjar script. It is used to persist the random user ID, unique to that site on the browser. This ensures that behavior in subsequent visits to the same site will be attributed to the same user ID. |
_hjid | 11 months 29 days 23 hours 59 minutes | This cookie is set by Hotjar. This cookie is set when the customer first lands on a page with the Hotjar script. It is used to persist the random user ID, unique to that site on the browser. This ensures that behavior in subsequent visits to the same site will be attributed to the same user ID. |
_hjid | 1 year | This cookie is set by Hotjar. This cookie is set when the customer first lands on a page with the Hotjar script. It is used to persist the random user ID, unique to that site on the browser. This ensures that behavior in subsequent visits to the same site will be attributed to the same user ID. |
_hjid | 1 year | This cookie is set by Hotjar. This cookie is set when the customer first lands on a page with the Hotjar script. It is used to persist the random user ID, unique to that site on the browser. This ensures that behavior in subsequent visits to the same site will be attributed to the same user ID. |
_hjIncludedInPageviewSample | 1 year | No description |
_hjIncludedInPageviewSample | 1 year | No description |
_hjIncludedInPageviewSample | 1 year | No description |
_hjIncludedInPageviewSample | 1 year | No description |
_hjSession_1776154 | session | No description |
_hjSessionUser_1776154 | session | No description |
_hjTLDTest | 1 year | No description |
_hjTLDTest | 1 year | No description |
_hjTLDTest | session | No description |
_hjTLDTest | session | No description |
_lfa_test_cookie_stored | past | No description |
Cookie | Duration | Description |
---|---|---|
loglevel | never | No description available. |
prism_90878714 | 1 month | No description |
redirectFacebook | 2 minutes | No description |
YSC | session | YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages. |
yt-remote-connected-devices | never | YouTube sets this cookie to store the video preferences of the user using embedded YouTube video. |
yt-remote-device-id | never | YouTube sets this cookie to store the video preferences of the user using embedded YouTube video. |
yt.innertube::nextId | never | This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen. |
yt.innertube::requests | never | This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen. |