Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Google

Inspect Rich Documents with Gemini Multimodality and Multimodal RAG

Google via Google Cloud Skills Boost

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Complete the intermediate Inspect Rich Documents with Gemini Multimodality and Multimodal RAG skill badge to demonstrate skills in the following: using multimodal prompts to extract information from text and visual data, generating a video description, and retrieving extra information beyond the video using multimodality with Gemini; building metadata of documents containing text and images, getting all relevant text chunks, and printing citations by using Multimodal Retrieval Augmented Generation (RAG) with Gemini. A skill badge is an exclusive digital badge issued by Google Cloud in recognition of your proficiency with Google Cloud products and services and tests your ability to apply your knowledge in an interactive hands-on environment. Complete this skill badge course and the final assessment challenge lab to receive a skill badge that you can share with your network.

Syllabus

  • Inspect Rich Documents with Gemini Multimodality and Multimodal RAG
    • Multimodality with Gemini
    • Using Gemini for Multimodal Retail Recommendations
    • Multimodal Retrieval Augmented Generation (RAG) using the Vertex AI Gemini API
    • Inspect Rich Documents with Gemini Multimodality and Multimodal RAG: Challenge Lab
  • Your Next Steps
    • Course Badge

Reviews

Start your review of Inspect Rich Documents with Gemini Multimodality and Multimodal RAG

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.