Add NVIDIA support to Inference Plugin #132388

Jan-Kazlouski-elastic · 2025-08-04T10:00:09Z

Creation of new NVIDIA inference provider integration allowing completion (both streaming and non-streaming) and chat_completion (only streaming) to be executed as part of inference API.
This is draft PR, rerank and text_embedding tasks are yet to be added

…ation # Conflicts: # server/src/main/java/org/elasticsearch/TransportVersions.java # x-pack/plugin/inference/src/main/java/org/elasticsearch/xpack/inference/InferenceNamedWriteablesProvider.java # x-pack/plugin/inference/src/main/java/org/elasticsearch/xpack/inference/InferencePlugin.java

Jan-Kazlouski-elastic added 2 commits July 29, 2025 10:55

Add Nvidia integration for Completion and Chat Completion

7e639a2

elasticsearchmachine added needs:triage Requires assignment of a team area label v9.2.0 external-contributor Pull request authored by a developer outside the Elasticsearch team labels Aug 4, 2025

Jan-Kazlouski-elastic marked this pull request as draft August 4, 2025 10:01

[CI] Auto commit changes from spotless

623ae46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add NVIDIA support to Inference Plugin #132388

Add NVIDIA support to Inference Plugin #132388

Uh oh!

Jan-Kazlouski-elastic commented Aug 4, 2025

Uh oh!

Uh oh!

Add NVIDIA support to Inference Plugin #132388

Are you sure you want to change the base?

Add NVIDIA support to Inference Plugin #132388

Uh oh!

Conversation

Jan-Kazlouski-elastic commented Aug 4, 2025

Uh oh!

Uh oh!