Divya Shetti | Portfolio

Case 1: Google Quantum AI

Overview & Problem Statement

Google's Quantum AI initiative aims to harness quantum computing to solve complex problems that classical computers find challenging. The Quantum processors are designed to accelerate tasks in optimization, simulation, and machine learning using advanced quantum algorithms.

The primary goal is to enable quantum computers to outperform classical systems in specific problem domains, such as:

Sampling problems that are computationally intensive
Optimization tasks with large solution spaces
Quantum chemistry simulations requiring high accuracy

Challenges

Building stable, error-corrected qubits
Minimizing decoherence and noise
Scaling qubit counts while keeping error rates in check
Developing algorithms for NISQ (Noisy Intermediate-Scale Quantum) devices

Existing Design & Algorithms

Processor: Sycamore (54-qubit superconducting chip)
Algorithms:
- Quantum Supremacy (random circuit sampling)
- QAOA – Quantum Approximate Optimization Algorithm
- VQE – Variational Quantum Eigensolver

Complexity Overview

Algorithm	Time Complexity	Qubit Requirement	Use Case
Random Circuit Sampling	Exponential (classical)	50+	Quantum supremacy
QAOA	Problem-dependent	Scalable	Optimization
VQE	Ansatz-specific	~20–50	Quantum chemistry

Real-time Usage & Figures

2019: Sycamore solved a problem in 200 seconds vs. 10,000 years on a classical supercomputer
Applications in quantum chemistry, cryptography, and ML prototypes
NISQ devices currently support ~50–100 noisy qubits

Other Applicable Algorithms

Quantum Phase Estimation
Grover’s Search
Quantum Fourier Transform
Quantum Machine Learning: QSVM, QNN

Benefits & Business Impact

Exponential speedups for specific computational tasks
Breakthroughs in optimization, simulation, and logistics
Revolutionizing drug discovery via quantum chemistry simulation
Enabling quantum-native AI systems beyond classical capabilities
Advancing quantum hardware/software co-design innovation
Unlocking new efficiencies in materials science and secure communication

Research & Insights

Though the exact proprietary algorithms and code are not publicly available, Google has published papers on quantum supremacy and related algorithms like the Variational Quantum Eigensolver (VQE).

My Thoughts

This use case demonstrates cutting-edge innovation, and while the underlying technology is complex, its potential impact on industries is vast. Understanding and adapting such advances could position future tech solutions on the frontier of AI and computing. An interesting coding approach I found valuable is dynamically generating qubits,this enables scalable, cleaner circuits and supports adaptable quantum algorithm design more effectively than static declarations.

Technology Stack

Languages: Python (Cirq)
Hardware: Superconducting Qubits, Surface Code
Research Paper: Nature, 2024

Applications

Drug discovery through quantum simulation
Secure communications via quantum cryptography
Accelerating AI models using quantum machine learning

Architecture Insight

References

Case 2: Advancements in Gemini AI

Overview & Problem Statement

Gemini AI represents a next-generation artificial intelligence system integrating large language models with advanced reasoning and multimodal capabilities to deliver highly contextual, accurate, and creative AI responses.

The core goal is to push AI beyond current limits by enabling:

Multimodal understanding combining text, images, and other data streams
Improved contextual reasoning for complex problem-solving
Adaptive learning that customizes interactions dynamically

Challenges

Balancing model size with efficiency and latency
Ensuring robust understanding across diverse inputs
Maintaining ethical considerations and bias mitigation
Scaling multimodal models while preserving accuracy

Existing Design & Algorithms

Architecture: Transformer-based large language and vision models
Key Components:
- Multimodal encoders integrating language and vision data
- Few-shot and zero-shot learning algorithms
- Reinforcement learning with human feedback (RLHF)

Complexity Overview

Model	Compute Complexity	Data Requirements	Use Case
Gemini Core Transformer	O(n²) with respect to input length	Massive multimodal datasets	General AI reasoning
Vision-Language Fusion	Varies by input size and modality	Image and text paired data	Multimodal understanding
RLHF Optimization	Dependent on feedback iterations	Human-labeled preference data	Response alignment and safety

Real-time Usage & Figures

Deployment across search, chatbots, and content generation platforms
Demonstrated improvements in contextual accuracy and reduced hallucinations
Supports millions of concurrent users worldwide

Other Applicable Algorithms

Attention Mechanisms (Self and Cross Attention)
Contrastive Learning for multimodal alignment
Knowledge Distillation to optimize model size
Federated Learning for privacy-preserving model updates

Benefits & Business Impact

Enhanced AI usability across diverse domains and media
More natural and accurate AI-human interactions
Reduced content creation costs through automation
Improved decision making via AI-powered insights
Innovations in AI safety and ethics integration

Research & Insights

Gemini AI builds on recent advances in large language models and multimodal transformers, incorporating extensive human feedback loops and privacy-aware learning. The project emphasizes robust, ethical AI deployment.

My Thoughts

Gemini AI exemplifies the next frontier of AI, merging multiple data streams into cohesive understanding. The challenge of balancing scale with responsiveness is notable, and I find the integration of adaptive learning and ethical safeguards particularly promising for real-world applications.

Technology Stack

Languages: Python, TensorFlow, JAX
Hardware: TPU clusters, GPU farms
Frameworks: Transformers, RL frameworks
Research Paper: Gemini AI Architecture, 2024

Applications

Enhanced search and recommendation systems
Multimodal chatbots and virtual assistants
Automated content generation across text and images

References

Google in 2025

1. 🚀 Google Beam, Gemini, Meet, Astra

1.1 Google Beam is an AI-driven video communication platform that creates immersive 3D experiences using multiple cameras.
1.2 Gemini enables real-time speech translation in meetings, breaking language barriers.
1.3 Google Meet supports real-time speech translation for English and Spanish.
1.4 Project Astra is an advanced AI assistant that uses your phone's camera for real-world interactions.

2. 🤖 Gemini with improved voice, memory, and control

2.1 Handles object misidentification situations.
2.2 Improved voice, memory, and device control; useful for tasks like accessing manuals.
2.3 Assists with bike-related tasks: repair videos, stock checks, emails, and manuals.

3. 🤖 Project Mariner and Gemini API/SDK

3.1 Advanced AI agent with up to 10 concurrent tasks.
3.2 "Teach and repeat" features; Gemini API to be released soon.
3.3 Agent ecosystem powered by Gemini SDK and Agent Mode.
3.4 Helps users with apartment searches and tour scheduling.
3.5 Adds personal memory features for better app recommendations.

4. 🚀 Gemini 2.5 Models

4.1 Personalized smart replies in Gmail.
4.2 Flash model: better speed, cost-efficiency.
4.3 Deep Think mode in Gemini 2.5 Pro improves performance.
4.4 Strong benchmark performance with safety reviews pending.
4.5 Gemini diffusion for parallel, faster text/code generation.
4.6 Advanced audio + multilingual coding support.
4.7 3D experience web app via AI Studio.
4.8 Sketch-to-code 3D animation generation.
4.9 Chrome-based multimodal animation creation app.

5. 🚀 Gemini AI with Coding Agent Jules

5.1 Jules automates complex coding tasks.
5.2 AI overviews now serve 1.5B+ monthly users.
5.3 Supports long, complex queries in Search.
5.4 AI Mode launched in the US with personalized responses.
5.5 Query fanout technique improves information retrieval.
5.6 Deep search offers expert-level, personalized results.

6. 🚀 Dynamic Search, Experiments & Tools

6.1 AI Search generates personalized data visualizations.
6.2 Streamlines event ticket searches and purchases.
6.3 Launch of "Search Live" using Google Lens.
6.4 AI Try-On for realistic virtual clothing fits.
6.5 Agentic checkout auto-buys when prices drop.
6.6 Gemini Live supports screen/cam share, 45+ languages.
6.7 Imagine 4 boosts image/text editing in Gemini.
6.8 V3 video model: better visuals, physics, audio.
6.9 Flow AI tool simplifies filmmaking with precision.

Company Innovations

Case 1: Google Quantum AI

Overview & Problem Statement

Challenges

Existing Design & Algorithms

Complexity Overview

Real-time Usage & Figures

Other Applicable Algorithms

Benefits & Business Impact

Research & Insights

My Thoughts

Technology Stack

Applications

Architecture Insight

References

Case 2: Advancements in Gemini AI

Overview & Problem Statement

Challenges

Existing Design & Algorithms

Complexity Overview

Real-time Usage & Figures

Other Applicable Algorithms

Benefits & Business Impact

Research & Insights

My Thoughts

Technology Stack

Applications

References

Google in 2025

1. 🚀 Google Beam, Gemini, Meet, Astra

2. 🤖 Gemini with improved voice, memory, and control

3. 🤖 Project Mariner and Gemini API/SDK

4. 🚀 Gemini 2.5 Models

5. 🚀 Gemini AI with Coding Agent Jules

6. 🚀 Dynamic Search, Experiments & Tools