A Journey Through Personal Albums and Exploring the Intersection of Tech and Humanity
Welcome to my personal blog that delves into the intricate tapestry of personal albums and the fascinating intersection of ever-evolving technology and humanity. Come along on a journey with me as we delve into the seamless fusion of creativity, state-of-the-art AI and robotics, intricately interwoven within the tapestry of our shared awareness.
Hover your mouse over the image and see the AI genereated caption and rating. Have fun!
Nvidia Enhances ChatRTX with Google Gemma AI Models
Nvidia, the renowned graphics card manufacturer, is taking a significant step forward in the realm of AI-powered chatbots with its latest update to ChatRTX. This experimental chatbot, designed to run locally on Windows PCs equipped with RTX GPUs, is now expanding its capabilities by integrating additional AI models and introducing voice query functionality.
ChatRTX or Chat with RTX
ChatRTX, also known as "Chat with RTX," allowed users to leverage Mistral or Llama 2 models to query personal documents fed into the system. However, with the recent update, Nvidia is broadening the horizons of ChatRTX by incorporating Google's Gemma, ChatGLM3, and even OpenAI's CLIP model. These additions aim to enhance the chatbot's ability to search through various types of data, including photos.
ChatRTX uses retrieval-augmented generation, NVIDIA TensorRT-LLM software and NVIDIA RTX acceleration. See below the youtube video for more details:
To run ChatRTX, users will need a compatible Nvidia GPU, specifically an RTX 30 or 40 Series GPU or RTX Ampere or Ada Generation GPU with at least 8GB of VRAM. The app streamlines the process of creating a local chatbot server, accessible through a web browser, where users can input their local documents and even YouTube videos. This powerful search tool generates summaries and provides answers to questions based on the user's own data.
The inclusion of Google's Gemma model in ChatRTX is particularly noteworthy, as it was designed to run directly on high-performance laptops or desktop computers. By integrating Gemma, Nvidia simplifies the complexity of running these models locally, offering users a user-friendly chatbot interface that allows them to select the most suitable model for their specific data analysis or search requirements.
Download ChatRTX
ChatRTX, available as a 12.5GB download from Nvidia's website, now also supports ChatGLM3, an open bilingual large language model that handles both English and Chinese. This addition expands the chatbot's linguistic capabilities, catering to a wider user base. Furthermore, the integration of OpenAI's Contrastive Language–Image Pre-training (CLIP) enables users to search and interact with local photo data, effectively training the model to recognize and understand images.
In a bid to enhance user interaction, Nvidia has also updated ChatRTX to support voice queries. By integrating Whisper, an AI speech recognition system, users can now search their data using voice commands. This feature adds a new dimension of convenience and accessibility to the chatbot, allowing users to retrieve information hands-free.
Google Gemma Models
Gemma is a family of lightweight, state-of-the-art open models available in two sizes: Gemma 2B and Gemma 7B. Each size comes with pre-trained and instruction-tuned variants. Gemma models achieve best-in-class performance for their sizes compared to other open models while adhering to Google's rigorous standards for safe and responsible outputs. Notably, Gemma surpasses significantly larger models on key benchmarks.
Gemma models can be fine-tuned on custom data to adapt to specific application needs, such as summarization or retrieval-augmented generation (RAG). The models support a wide variety of tools and systems, including multi-framework tools like Keras 3.0, PyTorch, JAX, and Hugging Face Transformers. Gemma models are compatible with various devices, including laptops, desktops, IoT, mobile, and cloud, enabling broadly accessible AI capabilities.
As Nvidia continues to push the boundaries of AI-powered chatbots, the latest updates to ChatRTX demonstrate the company's commitment to empowering RTX GPU owners with cutting-edge tools for data analysis and search. By leveraging a range of AI models and introducing voice query capabilities, Nvidia is making it easier for users to extract valuable insights from their personal data, whether it be documents, videos, or images. Also Google partnership with Nvidia to optimize Gemma for NVIDIA GPUs ensure industry-leading performance and integration with cutting-edge technology.