arrow_back

Build an AI Image Recognition app using Gemini on Vertex AI

Accedi Partecipa
Accedi a oltre 700 lab e corsi

Build an AI Image Recognition app using Gemini on Vertex AI

Lab 15 minuti universal_currency_alt Nessun costo show_chart Introduttivi
info Questo lab potrebbe incorporare strumenti di AI a supporto del tuo apprendimento.
Accedi a oltre 700 lab e corsi

Google Cloud self-paced labs logo

Overview

  • Labs are timed and cannot be paused. The timer starts when you click Start Lab.
  • The included cloud terminal is preconfigured with the gcloud SDK.
  • Use the terminal to execute commands and then click Check my progress to verify your work.

Objective

Generative AI on Vertex AI (also known as genAI or gen AI) gives you access to Google's large generative AI models so you can test, tune, and deploy them for use in your AI-powered applications. In this lab, you will:

  • Connect to Vertex AI (Google Cloud AI platform): Learn how to establish a connection to Google's AI services using the Vertex AI SDK.
  • Load a pre-trained generative AI model -Gemini: Discover how to use a powerful, pre-trained AI model without building one from scratch.
  • Send image + text questions to the AI model: Understand how to provide input for the AI to process.
  • Extract text-based answers from the AI: Learn to handle and interpret the text responses generated by the AI model.
  • Understand the basics of building AI applications: Gain insights into the core concepts of integrating AI into software projects.

Working with Vertex AI Python SDK

After starting the lab, you will get a split pane view consisting of the Code Editor on the left side and the lab instructions on the right side. Follow these steps to interact with the Generative AI APIs using Vertex AI Python SDK.

  1. Click File > New File to open a new file within the Code Editor.
  2. Copy and paste the provided code snippet into your file.
from google import genai from google.genai.types import HttpOptions, Part client = genai.Client(http_options=HttpOptions(api_version="v1")) response = client.models.generate_content( model="gemini-2.0-flash-001", contents=[ "What is shown in this image?", Part.from_uri( file_uri="gs://cloud-samples-data/generative-ai/image/scones.jpg", mime_type="image/jpeg", ), ], ) print(response.text)
  1. Click File > Save, enter genai.py for the Name field and click Save.

  2. To set the environment variables in the new terminal, run the following command:

    export GOOGLE_CLOUD_PROJECT='{{{ project_0.project_id | "project-id" }}}' export GOOGLE_CLOUD_LOCATION='{{{ project_0.default_region | "REGION" }}}' export GOOGLE_GENAI_USE_VERTEXAI=True
  3. Execute the Python file by invoking the below command inside the terminal within the Code Editor pane to view the output.

/usr/bin/python3 /genai.py Note: If you encounter a 400 error, try re-running the code.

Code Explanation

  • The code snippet is loading a pre-trained AI model called Gemini (gemini-2.0-flash-001) on Vertex AI.
  • The code calls the generate_content method of the loaded Gemini model.
  • The input to the method is an image URI and a prompt containing a question about the image.
  • The code uses Gemini's ability to understand images and text together. It uses the text provided in the prompt to describe the contents of the image.

Try it yourself! Experiment with different image URIs and prompt questions to explore Gemini's capabilities.

Click Check my progress to verify the objective.

Generate content for the image

Congratulations!

You have completed the lab! Congratulations!!

Copyright 2025 Google LLC. All rights reserved. Google and the Google logo are trademarks of Google LLC. All other company and product names may be trademarks of the respective companies with which they are associated.

Prima di iniziare

  1. I lab creano un progetto e risorse Google Cloud per un periodo di tempo prestabilito
  2. I lab hanno un limite di tempo e non possono essere messi in pausa. Se termini il lab, dovrai ricominciare dall'inizio.
  3. In alto a sinistra dello schermo, fai clic su Inizia il lab per iniziare

Utilizza la navigazione privata

  1. Copia il nome utente e la password forniti per il lab
  2. Fai clic su Apri console in modalità privata

Accedi alla console

  1. Accedi utilizzando le tue credenziali del lab. L'utilizzo di altre credenziali potrebbe causare errori oppure l'addebito di costi.
  2. Accetta i termini e salta la pagina di ripristino delle risorse
  3. Non fare clic su Termina lab a meno che tu non abbia terminato il lab o non voglia riavviarlo, perché il tuo lavoro verrà eliminato e il progetto verrà rimosso

Questi contenuti non sono al momento disponibili

Ti invieremo una notifica via email quando sarà disponibile

Bene.

Ti contatteremo via email non appena sarà disponibile

Un lab alla volta

Conferma per terminare tutti i lab esistenti e iniziare questo

Utilizza la navigazione privata per eseguire il lab

Utilizza una finestra del browser in incognito o privata per eseguire questo lab. In questo modo eviterai eventuali conflitti tra il tuo account personale e l'account Studente, che potrebbero causare addebiti aggiuntivi sul tuo account personale.