Google has introduced a tool that simplifies the setup of retrieval augmented generation (RAG) pipelines for enterprises. The File Search Tool, part of Google’s Gemini API, abstracts the retrieval pipeline, eliminating the need for complex engineering tasks such as storage solutions and embedding creators. This tool offers a more standalone and less orchestrated solution compared to similar products from OpenAI, AWS, and Microsoft.
File Search leverages Google’s Gemini Embedding model, known for its high performance on the Massive Text Embedding Benchmark. By handling file storage, chunking strategies, and embeddings, File Search streamlines the complexities of RAG, making it easier for developers to integrate within existing APIs.
Using vector search, File Search can understand query context and provide accurate responses even with inexact search terms. It supports various file formats and includes built-in citations for transparency and verification. Enterprises can access certain features for free initially, with indexing fees set at $0.15 per 1 million tokens.
While other platforms like OpenAI’s Assistants API and AWS’s Bedrock offer similar functionalities, Google’s File Search abstracts the entire RAG pipeline creation process, enhancing efficiency and productivity for users. Phaser Studio, a game generation platform, reported significant time savings and improved productivity using File Search.
Source: VentureBeat