The differences between a Data Scientist and a Data Engineer
In data and business intelligence, data scientist and data engineer are both on the rise. But what are they doing? How do they work together and manage the relational aspects? And how do they complement each other? Before the era of big data, there were already two similar roles. We can call them the “ancestors” […]
What is overfitting and how to solve it in machine learning?
This article explains the phenomenon of overfitting in data science. It is one of the most recurrent problems in machine learning. We give you some clues to detect it, to overcome it, and to make your predictions with precision. A definition of overfitting You have probably already experienced, in the age of big data and […]
How to Easily Schedule Jobs with Apache Airflow?
This article is intended for both Airflow beginners and veterans and aims to present the fundamental objects of this technology as well as its interfacing with Saagie’s DataOps platform. We are not going to explain to you again how to create a Directed Acyclic Graph (commonly called DAG) or how to plan them. Indeed, there […]
What is MLOps ?
The recent excitement around data science, and big data, has enabled the development of an extremely rich and dynamic ecosystem around the analysis of collected data. Open source tools, which are increasingly easy to use, are enabling many organizations to start analyzing their data. However, the multiplication of data projects and algorithms has also brought […]
What is Deep Learning and how does it work?
First of all, it is important to know the history of Deep Learning, before discovering in two other parts how it works and, finally, its future prospects. Deep Learning is a relatively new terminology, unlike deep neural networks, which it refers to. The theory behind Deep Learning is therefore not recent, and even if new […]
Artificial Intelligence in video games
The first results when searching for “Artificial Intelligence in video games” in Google speak for themselves: Artificial Intelligence (AI) in video games is often unsatisfactory. As a gamer, it is moreover frequent to be confronted with situations that lose credibility because of the AI behavior. Is this a real problem? Why does AI seem to stagnate […]
What are the keys to launch your data project?
Is it possible to deploy a data project, from scoping to large-scale deployment, in 10 weeks? Let’s take a closer look at the keys to accelerating this type of project, which can sometimes take up to 18 months to generate value. Before starting anything, it is essential to know fairly quickly whether there is a […]
Which technologies for your data projects?
You can easily get lost in the data technology ecosystem. The technological offer in data management being very (too?!) rich, many solutions are available to you according to your needs, data sources, industries, infrastructures, skills, technological situation? This is why we present you with a review and advice on how to choose your analysis tools. […]
What is Open Data?
Open data refers to the practice of provision of public digital data. The data is accessible online and one can freely consult, share, and reuse these data (statistics, measures, maps, opening times). The Genesis of Open Data According to the Sunlight Foundation, American Association in favor of transparency and the provision of data for the […]
Saagie Raises $28 million to Revolutionize Data Project Deployment
The French startup aims to become the leader in DataOps by accelerating international development and doubling staff over the next two years Paris, June 2, 2020 – Saagie, the French software provider, today announces a $28 million fundraising round with Crédit Mutuel Innovation, alongside NewAlpha Asset Management, Seventure Partners and AG2R LA MONDIALE. Historic […]
What is Hybrid Cloud and Why do Companies Choose it?
In order to host and store their data, in the age of digital transformation, a growing number of companies have turned to cloud computing technology. According to a recent Gartner report, the impact and importance of the cloud for businesses can be compared to that of the Internet. Among its benefits we can mention the […]
How to Manage Machine Learning Deployment?
In this article, you will learn on how to deploy Machine Learning in Agile way to support your data projects. Here are 5 steps to keep in mind when addressing this kind of projects. Machine Learning Deployment Should be Managed as a Project When we think about Machine Learning deployment, we often think just about the […]