Google's venture into advanced multimodal artificial intelligence began with the launch of Gemini 1.0 in December 2023. This model represented a fundamental shift in AI architecture, designed from the ground up to process and reason across multiple data types simultaneously-text, images, audio, video, and code. Unlike earlier models that treated multimodality as an add-on feature, Gemini 1.0 was natively multimodal, trained on diverse datasets to understand the interconnected nature of real-world information. It introduced three variants: Gemini Ultra for maximum capability in complex tasks, Gemini Pro for balanced performance in general applications, and Gemini Nano for efficient on-device processing, particularly suited to mobile environments. This foundational release addressed key limitations in prior large language models, such as restricted context windows and siloed modality handling. Gemini 1.0 powered early integrations across Google products, including the rebranding of Bard to Gemini and enhancements in Search and Workspace tools. It demonstrated strong performance on benchmarks measuring multimodal understanding and long-context reasoning, setting a new standard for how AI could interpret combined inputs, such as describing images in detail or generating code from visual diagrams.
ThriftBooks sells millions of used books at the lowest everyday prices. We personally assess every book's quality and offer rare, out-of-print treasures. We deliver the joy of reading in recyclable packaging with free standard shipping on US orders over $20. ThriftBooks.com. Read more. Spend less.