I am requesting the addition of support for the OpenAI Batch API within the Spring AI framework.
The OpenAI Batch API is designed for efficient, asynchronous processing of large volumes of AI requests. Integrating this functionality into Spring AI would provide significant advantages for developers, enabling them to:
-
Process high volumes of requests cost-effectively: The Batch API offers a 50% cost discount compared to synchronous API calls.
-
Overcome rate limit constraints: It utilizes separate, significantly higher rate limits, allowing for much greater throughput without impacting real-time application performance.
-
Manage large-scale AI tasks efficiently: Batches are guaranteed to complete within 24 hours, making it suitable for operations where immediate responses are not critical.
This feature would be particularly beneficial for use cases such as:
-
Running evaluations: Automating the assessment of models or data.
-
Classifying large datasets: Efficiently categorizing extensive collections of text or other content.
-
Embedding content repositories: Generating embeddings for vast amounts of data for retrieval-augmented generation (RAG) or other applications.
By supporting the OpenAI Batch API, Spring AI would greatly enhance its capabilities for building robust and scalable AI applications that handle large-scale, asynchronous AI workloads.
I gathered this information from the official OpenAI documentation on their Batch API guide.