Elastic Introduces Native Inference Service in Elastic Cloud

MWN-AI** Summary

Elastic, a leader in search technology, has unveiled its new Elastic Inference Service (EIS), a GPU-accelerated inference-as-a-service designed for use within the Elastic Cloud. This service aims to enhance Elasticsearch's capabilities for semantic search, vector search, and generative AI applications by streamlining inference processes critical to these technologies.

As the demand for generative AI and vector search solutions grows, Elastic recognizes the challenges of managing infrastructure, testing models, and integrating systems, which can hinder team efficiency. The EIS addresses these issues by offering an API-based service powered by NVIDIA GPUs, allowing low-latency and high-throughput inference that significantly boosts performance. According to Steve Kearns, General Manager of Search at Elastic, "Inference at scale is incredibly important for vector search, semantic search and GenAI workflows," and EIS meets this need.

In its technical preview, EIS features the Elastic Learned Sparse Encoder (ELSER), a text-embedding model that provides state-of-the-art search relevance. Future updates will introduce additional models for multilingual embeddings and enhancements from Elastic’s recent acquisition of Jina.

Key benefits of EIS include a streamlined developer experience with integrated workflows, improved semantic search capabilities with support for various vector types, and simplified generative AI implementations. The service also ensures backward compatibility with existing Elasticsearch ML Nodes and offers GPU performance that promises up to 10 times greater throughput compared to CPU-based approaches.

Elastic EIS operates on both Serverless and Elastic Cloud Hosted deployments, accessed by all cloud service providers and regions, with transparent consumption-based pricing. This innovation not only amplifies Elastic's product offerings but reinforces its commitment to providing advanced AI-driven search solutions.

MWN-AI** Analysis

Elastic's recent launch of the Elastic Inference Service (EIS) marks a significant advancement in operational efficiency for enterprises navigating the complexities of generative AI and vector search applications. This GPU-accelerated inference-as-a-service model positions Elastic favorably within the competitive landscape of cloud-based AI solutions, particularly in an environment where speed and scalability are crucial.

The introduction of EIS streamlines the developer experience by eliminating the need for cumbersome model downloads and manual configurations. This is particularly valuable for organizations that may lack extensive technical resources or expertise, enabling them to focus their efforts on core business objectives rather than infrastructure management. The API-based integration with existing Elastic offerings promises improved end-to-end semantic search capabilities, facilitating enhanced retrieval accuracy and relevance in a data-driven era.

Investors should take note of the strategic implications of EIS within Elastic’s business model. The sheer potential for operational cost savings through GPU acceleration—reported to yield up to a 10x higher throughput compared to CPU alternatives—is compelling. This could attract a broader customer base, particularly among organizations leveraging AI for real-time data analytics and insights. Moreover, the predictable consumption-based pricing model aligns with market trends favoring flexibility and financial accountability.

The successful rollout of EIS also represents a timely response to growing market demands for integrated AI capabilities. Elastic’s ability to support complex workflows with minimal friction, alongside robust backward compatibility, enhances its value proposition in a rapidly evolving competitive landscape.

With a well-established presence in the market (catering to over 50% of Fortune 500 companies), this advancement could stimulate robust growth for Elastic (NYSE: ESTC) in Q4 2023 and beyond. Now might be an ideal time for investors to evaluate potential positions in Elastic as EIS gains traction and additional models are anticipated.

**MWN-AI Summary and Analysis is based on asking OpenAI to summarize and analyze this news release.

October 09, 2025 11:02:00 am

Source: Business Wire

New service to provide GPU-accelerated embedding and retrieval models

Elastic (NYSE: ESTC), the Search AI Company, today announced the Elastic Inference Service (EIS), a GPU-accelerated inference-as-a-service for Elasticsearch semantic search, vector search, and generative AI workflows.

Every generative AI and vector search application relies on inference, and Elastic now delivers these capabilities natively as part of Elastic Cloud. As volumes grow, managing infrastructure, testing models, and handling integrations creates operational overhead that slows teams down. This has created a need for GPU-acceleration and an integrated workflow to provide speed, scalability, and cost efficiency.

“Inference at scale is incredibly important for vector search, semantic search and GenAI workflows,” said Steve Kearns, General Manager, Search at Elastic . “The Elastic Inference Service meets that challenge by providing our customers with an API-based inference service using NVIDIA GPUs with our best-in-class Elasticsearch vector database for low-latency, high-throughput inference.”

Elastic Learned Sparse EncodeR (ELSER) — Elastic’s built-in sparse vector model for state-of-the-art search relevance — is the first text-embedding model available on EIS in technical preview. Support for additional models for multilingual embeddings, reranking, and models from the recently announced Jina acquisition, will be available soon.

Some key benefits for developers who use EIS include:

Streamlined developer experience : No model downloads, manual configuration, or resource provisioning. EIS integrates directly with semantic_text and the Inference API for a seamless developer experience.
Improved end-to-end semantic search experience : EIS is compatible with sparse vectors, dense vectors, or semantic reranking.
Simplified generative AI workflows : AI features for ingest, investigation, detection, and analysis work out of the box, reducing the friction of contracts, API keys, and external services.
Backward compatibility : The Open Inference API gives users full flexibility to connect any third-party service, while existing Elasticsearch ML Nodes remain supported during adoption.
Enhanced performance : GPU-accelerated inference provides consistent latency and up to 10x higher throughput for ingest compared to CPU-based alternatives.
Easy to understand pricing : EIS provides consumption-based pricing similar to other inference services, charged per model per million tokens. It is also easy to get started and access support.
Peace of mind : Elastic also provides an intellectual property indemnity for all models provided on EIS.

For additional information on the Elastic Inference Service, read the Elastic blog .

Availability

The Elastic Inference Service is available to use on Serverless and Elastic Cloud Hosted deployments. All CSPs and regions can access the inference endpoints on EIS.

Additional models will be available soon to support a wider variety of search and inference needs.

About Elastic

Elastic (NYSE: ESTC), the Search AI Company, integrates its deep expertise in search technology with artificial intelligence to help everyone transform all of their data into answers, actions, and outcomes. Elastic's Search AI Platform — the foundation for its search, observability, and security solutions — is used by thousands of companies, including more than 50% of the Fortune 500. Learn more at elastic.co .

Elastic and associated marks are trademarks or registered trademarks of elasticsearch BV and its subsidiaries. All other company and product names may be trademarks of their respective owners. The release and timing of any features such as the additional models and region availability or functionality described in this post remain at Elastic's sole discretion. Any features or functionality not currently available may not be delivered on time or at all.

View source version on businesswire.com: https://www.businesswire.com/news/home/20251009383846/en/

Media Contact
Elastic PR
PR-team@elastic.co

FAQ**

How does the launch of the Elastic Inference Service (EIS) enhance Elastic N.V. ESTC's competitive edge in the rapidly evolving AI and semantic search market?

The launch of Elastic Inference Service (EIS) enhances Elastic N.V. (ESTC)'s competitive edge by enabling efficient AI model deployment and scaling, thus improving performance and reducing costs in semantic search applications, positioning the company favorably in a growing market.

What strategic benefits does Elastic N.V. ESTC anticipate from integrating NVIDIA GPUs in EIS for performance improvements in vector search and generative AI workflows?

Elastic N.V. expects that integrating NVIDIA GPUs into its EIS will enhance performance in vector search and generative AI workflows, leading to faster processing, improved scalability, and more efficient resource utilization, ultimately driving customer satisfaction and market competitiveness.

How does Elastic N.V. ESTC plan to expand the range of models available on EIS to meet diverse customer needs and enhance user experience?

Elastic N.V. (ESTC) plans to expand the range of models on the Elastic Cloud by leveraging customer feedback, integrating advanced AI and machine learning capabilities, and enhancing collaboration with developers to ensure a broader, more tailored user experience.

What measures are being implemented by Elastic N.V. ESTC to ensure the security and reliability of EIS, especially considering its API-based infrastructure?

Elastic N.V. (ESTC) implements robust security measures for its EIS by employing advanced encryption, strict authentication protocols, regular security audits, and continuous monitoring to safeguard its API-based infrastructure against vulnerabilities and ensure reliability.

**MWN-AI FAQ is based on asking OpenAI questions about Elastic N.V. (NYSE: ESTC).

Elastic Introduces Native Inference Service in Elastic Cloud

MWN-AI** Summary

MWN-AI** Analysis

FAQ**

How does the launch of the Elastic Inference Service (EIS) enhance Elastic N.V. ESTC's competitive edge in the rapidly evolving AI and semantic search market?

What strategic benefits does Elastic N.V. ESTC anticipate from integrating NVIDIA GPUs in EIS for performance improvements in vector search and generative AI workflows?

How does Elastic N.V. ESTC plan to expand the range of models available on EIS to meet diverse customer needs and enhance user experience?

What measures are being implemented by Elastic N.V. ESTC to ensure the security and reliability of EIS, especially considering its API-based infrastructure?

NASDAQ: ESTC

ESTC Trading

ESTC Latest News

ESTC Stock Data

Subscribe to Our Newsletter

Link Market Wire News to Your X Account

Download The Market Wire News App