Restore model registry validation for the semantic text field #127285

jimczi · 2025-04-23T20:57:13Z

This PR restores the changes reverted in #127075 and fixed the bug that caused the test failures.
The resolved model is now only used for validation during the creation of the semantic text field
and the final value is set on the first ingestion to ensure consistency.
This change also sets the default for semantic_text field using dense vectors to BBQ.

This reverts commit e280aa5.

…rt_back

elasticsearchmachine · 2025-04-23T20:57:47Z

Pinging @elastic/es-search-relevance (Team:Search Relevance)

elasticsearchmachine · 2025-04-23T20:57:48Z

Hi @jimczi, I've created a changelog YAML for you.

kderusso

Thanks for putting this PR together. I'm not sure I quite understand based on the commit history what was causing this bug - it looks like the changes after the revert were pretty minimal. Would it be possible to walk me through how this fixed the original test failures so I can better understand if we want to proactively add more tests? Thanks!

...resources/rest-api-spec/test/inference/50_semantic_text_query_inference_endpoint_changes.yml

jimczi · 2025-04-24T12:05:42Z

Would it be possible to walk me through how this fixed the original test failures so I can better understand if we want to proactively add more tests?

The issue comes up when an inference endpoint gets created after the semantic field that points to it. In that case, the model only gets resolved when the mapping is updated. What’s happening in this bug is that the first ingestion triggers that update. We get the model definition from the bulk request and try to do a dynamic mapping update.

Dynamic updates run on the master node. There, it parses the current mapping (which doesn’t have the model settings yet) and merges it with the update. But since the model was already resolved via the registry during the initial parse, the system sees the update as a no-op which isn’t allowed during bulk requests.

The fix is to skip setting the model settings in the mapping if the model was only resolved through the registry. We still use the resolved model to build sub-fields and check the settings, but we don’t actually include it in the field mapping just yet. Then, when the first ingestion happens, the model settings get properly added through that dynamic update.

kderusso

Changes look reasonable to me, and I haven't heard any feedback in the channel w.r.t. adding additional tests, so approving. Thanks for the fix!

Mikep86

Found the new test I was looking for. Small nit about the test location.

...resources/rest-api-spec/test/inference/50_semantic_text_query_inference_endpoint_changes.yml

...e/src/yamlRestTest/resources/rest-api-spec/test/inference/30_semantic_text_inference_bwc.yml

…rt_back

jimczi · 2025-04-29T08:11:06Z

@elasticmachine run elasticsearch-ci/part-3

jimczi · 2025-04-29T08:11:18Z

@elasticmachine run elasticsearch-ci

This PR is a partial backport of elastic#127285 that fixes the validation of the inference id when mappings are restored or dynamically updated. This change doesn't include defaulting semantic text dense vector to BBQ since it requires elastic#124581 to be backported first.

…27559) This PR is a partial backport of #127285 that fixes the validation of the inference id when mappings are restored or dynamically updated. This change doesn't include defaulting semantic text dense vector to BBQ since it requires #124581 to be backported first.

jimczi added 2 commits April 23, 2025 21:41

Revert "Revert semantic_text model registry changes (elastic#127075)"

6dd5e59

This reverts commit e280aa5.

Merge remote-tracking branch 'upstream/main' into model_registry_reve…

9798838

…rt_back

jimczi added >enhancement :Search Relevance/Search Catch all for Search Relevance v9.0.0 v8.19.0 labels Apr 23, 2025

jimczi requested a review from kderusso April 23, 2025 20:57

jimczi added v9.1.0 and removed v9.0.0 labels Apr 23, 2025

elasticsearchmachine added the Team:Search Relevance Meta label for the Search Relevance team in Elasticsearch label Apr 23, 2025

Update docs/changelog/127285.yaml

a4e02f3

kderusso reviewed Apr 24, 2025

View reviewed changes

...resources/rest-api-spec/test/inference/50_semantic_text_query_inference_endpoint_changes.yml Outdated Show resolved Hide resolved

Merge branch 'main' into model_registry_revert_back

7c6a8b4

kderusso approved these changes Apr 28, 2025

View reviewed changes

Mikep86 approved these changes Apr 28, 2025

View reviewed changes

...resources/rest-api-spec/test/inference/50_semantic_text_query_inference_endpoint_changes.yml Outdated Show resolved Hide resolved

address review comment

c8a2fce

Mikep86 reviewed Apr 28, 2025

View reviewed changes

...e/src/yamlRestTest/resources/rest-api-spec/test/inference/30_semantic_text_inference_bwc.yml Outdated Show resolved Hide resolved

jimczi added 3 commits April 28, 2025 15:01

add legacy format option

eddac44

fix schema

be3d7db

Merge remote-tracking branch 'upstream/main' into model_registry_reve…

ab741e7

…rt_back

jimczi merged commit 85d375c into elastic:main Apr 29, 2025
17 checks passed

jimczi deleted the model_registry_revert_back branch April 29, 2025 09:36

jimczi added the backport pending label Apr 30, 2025

jimczi mentioned this pull request Apr 30, 2025

[8.19] Fix inference model validation for the semantic text field #127559

Merged

jimczi removed the backport pending label Jun 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Restore model registry validation for the semantic text field #127285

Restore model registry validation for the semantic text field #127285

Uh oh!

jimczi commented Apr 23, 2025

Uh oh!

elasticsearchmachine commented Apr 23, 2025

Uh oh!

elasticsearchmachine commented Apr 23, 2025

Uh oh!

kderusso left a comment

Uh oh!

Uh oh!

jimczi commented Apr 24, 2025

Uh oh!

kderusso left a comment

Uh oh!

Mikep86 left a comment

Uh oh!

Uh oh!

Uh oh!

jimczi commented Apr 29, 2025

Uh oh!

jimczi commented Apr 29, 2025

Uh oh!

Uh oh!

Uh oh!

Restore model registry validation for the semantic text field #127285

Restore model registry validation for the semantic text field #127285

Uh oh!

Conversation

jimczi commented Apr 23, 2025

Uh oh!

elasticsearchmachine commented Apr 23, 2025

Uh oh!

elasticsearchmachine commented Apr 23, 2025

Uh oh!

kderusso left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jimczi commented Apr 24, 2025

Uh oh!

kderusso left a comment

Choose a reason for hiding this comment

Uh oh!

Mikep86 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

jimczi commented Apr 29, 2025

Uh oh!

jimczi commented Apr 29, 2025

Uh oh!

Uh oh!

Uh oh!