CALM demo embedding model bug with Cohere and Bedrock

alekseev · May 5, 2025, 2:55pm

Hello! I’m experimenting with the CALM demo bot and I found an issue. I provide cohere.embed-multilingual-v3 via Bedrock as an embedding model and when I run rasa train with it and the default config.yml from the demo repository (I also changed the LLM component to one of the bedrock models), I get the following error during the training of the IntentlessPolicy:

2025-05-05 15:57:49 ERROR    rasa.core.policies.intentless_policy  - [error    ] intentless_policy.train.llm.error error=ProviderClientAPIException('Failed to embed documents\nOriginal error: litellm.BadRequestError: BedrockException - {"message":"Malformed input request: #/texts: expected maximum item count: 128, found: 300, please reformat your input and try again."})')

I looked deeper into it and indeed, while performing an API call for the embedding model there is a list of 300 phrases that is provided to the embedding model - from what I understood these phrases are taken from domain/nlu_based, domain/_shared.yml, domain/search. Seems like most of them come from the Squad dataset domain/search/squad.yml.

I also tried with amazon.titan-embed-text-v2:0 embedding model on Bedrock and it seems like LiteLLM handles these models differently:

if I pass a list of phrases to litellm.embedding(model="amazon.titan-embed-text-v2:0", ...), under the hood it will make a separate request for each phrase.
while for cohere.embed-multilingual-v3, it will only make one request - and there is a upper limit on the number of elements to embed.

I just wanted to bring your attention to this particularity of cohere and maybe other embedding models that may affect embedding-related components - it may be worth it to split all documents into batches in embed() of shared.providers.embedding.embedding_client.EmbeddingClient.

Also, a question: am I understanding correctly that during the training IntentlessPolicy is importing all the responses in the domain yaml files that are in no flow?

Thank you!

m_ashurkina · May 9, 2025, 3:38pm

Hi @alekseev thank you for reporting this issue. This is a bug, and we are working to fix it.

Regarding your question:

Also, a question: am I understanding correctly that during the training 
IntentlessPolicy is importing all the responses in the domain yaml 
files that are in no flow?

Do I understand correctly that you want to know if the responses that are not used in any of the flows are also imported during the training?

Topic		Replies	Views
Integrate Cohere with Rasa Pro Rasa Pro CALM	5	303	May 14, 2024
Issue connecting RASA PRO CALM with Embedding model on self-hosted vllm server: litellm.BadRequestError: LLM Provider NOT provided Rasa Pro CALM	0	76	December 2, 2024
Getting error "[error ] llm command generator.llm.error error=AttributeError("module 'cohere' has no attribute 'error'")", how to resolve it Rasa Pro CALM	1	273	May 16, 2024
Minimal viable LLM for Command Generation? Rasa Pro CALM	6	143	October 7, 2024
Rasa Pro coexistence issue Rasa Pro CALM rasa_nlu , calm	2	29	May 7, 2025

CALM demo embedding model bug with Cohere and Bedrock

Related topics