Note4Students
From UPSC perspective, the following things are important :
Prelims level: Google Gemini
Mains level: Recent breakthrough in AI
Central Idea
- Google has introduced Gemini, a new multimodal general AI model, available globally through Bard.
- It is seen as Google’s response to ChatGPT, offering advanced capabilities in the realm of GenAI.
What is Google Gemini?
- Unlike ChatGPT, Gemini can process and operate across various formats including text, code, audio, image, and video.
- Google claims Gemini Ultra surpasses current models in academic benchmarks and is the first to outperform human experts in massive multitask language understanding (MMLU).
Different versions available
- Three Variants: Gemini comes in three sizes – Ultra, Pro, and Nano – each designed for specific levels of complexity and tasks.
- Gemini Ultra: Intended for highly complex tasks, currently in a trial phase with select users.
- Gemini Pro: Available in Bard for general users, offering advanced reasoning and understanding, and accessible to developers via Google AI Studio or Google Cloud Vertex AI.
- Gemini Nano: Focused on on-device tasks, already integrated into Pixel 8 Pro, and soon available to Android developers via AICore in Android 14.
Addressing Challenges of Hallucinations and Safety
- Factuality and Hallucinations: While improvements have been made, Gemini, like other LLMs, is still prone to hallucinations. Google uses additional techniques in Bard to enhance response accuracy.
- Safety Measures: Google emphasizes new protections for Gemini’s multimodal capabilities, conducting comprehensive safety evaluations, including bias and toxicity assessments.
- Ongoing Safety Research: Google collaborates with external experts to stress-test models and identify potential risks in areas like cyber-offence and persuasion.
Hallucination: Asking a generative AI application for five examples of bicycle models that will fit in the back of your specific make of sport utility vehicle. If only three models exist, the GenAI application may still provide five — two of which are entirely fabricated. |
Comparing Gemini and ChatGPT 4
- Flexibility and Capabilities: Gemini appears more versatile than GPT4, especially with its video processing and offline functionality.
- Accessibility and Cost: Unlike the paid-access ChatGPT4, Gemini is currently free to use, potentially giving it a broader user base.
Get an IAS/IPS ranker as your 1: 1 personal mentor for UPSC 2024