Google Launches Free Gemini-Powered Data Science Agent for Colab

March 4, 2025

3 minutes

🟢easy Reading Level

Google's Data Science Agent, powered by Gemini 2.0, is now freely available on Colab, designed to simplify data analysis by automating the creation of fully functional Jupyter notebooks.

In this blog post, we'll explore the key features and capabilities of the Data Science Agent, and how it can be used to accelerate data analysis in Google Colab.

What Is Data Science Agent?

The Data Science Agent is an AI-powered assistant that takes your text prompt of an analysis goal and generates a complete Colab notebook. It handles tasks such as data loading, library imports, preliminary exploratory analysis, and even code for visualizing results. Powered by Gemini 2.0, this tool builds upon Google Colab's eight-year legacy of providing cloud-based Jupyter Notebook environments for running Python code on Google's GPUs and TPUs.

What's New Inside?

This agent represents a leap forward in automating repetitive and time-consuming tasks:

  • Automated notebook generation: Instead of manually setting up your environment and writing boilerplate code, you simply upload your dataset and describe your objectives.
  • End-to-end workflow support: It goes beyond mere code snippets to create notebooks that are fully executable, saving precious time in the early stages of analysis.
  • Modifiability: The notebooks are generated with clarity and structure in mind, allowing you to easily customize and extend the code for deeper analysis.

The agent has demonstrated impressive performance in industry benchmarks, ranking 4th on the DABStep: Data Agent Benchmark for Multi-step Reasoning on Hugging Face, ahead of several notable AI agents including ReAct (GPT-4.0), Deepseek, Claude 3.5 Haiku, and Llama 3.3 70B.

What Does Data Science Agent Allow Users to Do?

By using the Data Science Agent, data scientists and researchers can focus on insights rather than spending hours on initial setup and coding and eliminate common pitfalls in early-stage data processing and exploratory data analysis. Such agentic tools also lower the barrier to entry by transforming complex tasks into simple, natural language prompts.

The tool supports various types of analysis, including:

  • Visualizing trends and patterns in data
  • Training prediction models
  • Cleaning and handling missing values
  • Calculating and visualizing various types of correlations
  • Performing classification tasks

Who Can Use It and How?

The Data Science Agent is currently available to Colab users age 18+ and in select countries and languages.

To get started, users can:

  1. Open a new Colab notebook
  2. Upload their dataset (CSV, JSON, etc.)
  3. Describe their analysis goals using natural language in the Gemini side panel
  4. Execute the generated notebook to see insights and visualizations

While Google Colab offers a free tier, users requiring additional computing power can choose from several paid options:

  • Colab Pro ($9.99/month): Includes 100 compute units, faster GPUs, more memory, and terminal access
  • Colab Pro+ ($49.99/month): Offers 500 compute units, priority GPU access, and background execution
  • Pay-as-you-go: Options for 100 compute units ($9.99) or 500 compute units ($49.99)
  • Colab Enterprise: Features Google Cloud integration and AI-powered code generation

The Data Science Agent has already shown promising results in real-world applications. For instance, scientists at Lawrence Berkeley National Laboratory reported reducing their data processing time from one week to just five minutes when analyzing tropical wetland methane emissions.

Conclusion

The Data Science Agent is a promising step toward streamlining repetitive tasks in data analysis, allowing you to concentrate on what truly matters: extracting insights and driving decisions. As feedback is integrated and the technology matures, it's poised to become an essential tool for data professionals worldwide.

Valeriia Kuka

Valeriia Kuka, Head of Content at Learn Prompting, is passionate about making AI and ML accessible. Valeriia previously grew a 60K+ follower AI-focused social media account, earning reposts from Stanford NLP, Amazon Research, Hugging Face, and AI researchers. She has also worked with AI/ML newsletters and global communities with 100K+ members and authored clear and concise explainers and historical articles.


© 2025 Learn Prompting. All rights reserved.