Hi,
I’m new to rasa pro I want to use hugging face for the LLM instead of OpenAI.
I’m getting this error while running.
C:\Users\chandrasekhar.m\AppData\Local\anaconda3\envs\plus\lib\site-packages\huggingface_hub\utils_deprecation.py:131: FutureWarning: ‘InferenceApi’ (from ‘huggingface_hub.inference_api’) is deprecated and will be removed from version ‘1.0’.
InferenceApi
client is deprecated in favor of the more feature-completeInferenceClient
. Check out this guide to learn how to convert your script to use it: Run Inference on servers. warnings.warn(warning_message, FutureWarning) 2024-03-27 14:29:49 ERROR rasa.utils.log_utils - [error ] llm_command_generator.llm.error error=ValueError(‘Error raised by inference API: Request failed during generation: Server error: Out of available cache blocks: asked 52, only 11 free blocks’) C:\Users\chandrasekhar.m\AppData\Local\anaconda3\envs\plus\lib\site-packages\sanic\server\websockets\impl.py:521: DeprecationWarning: The explicit passing of coroutine objects to asyncio.wait() is deprecated since Python 3.8, and scheduled for removal in Python 3.11. done, pending = await asyncio.wait( C:\Users\chandrasekhar.m\AppData\Local\anaconda3\envs\plus\lib\site-packages\huggingface_hub\utils_deprecation.py:131: FutureWarning: ‘InferenceApi’ (from ‘huggingface_hub.inference_api’) is deprecated and will be removed from version ‘1.0’.InferenceApi
client is deprecated in favor of the more feature-completeInferenceClient
. Check out this guide to learn how to convert your script to use it: Run Inference on servers. warnings.warn(warning_message, FutureWarning) 2024-03-27 14:31:56 ERROR rasa.utils.log_utils - [error ] llm_command_generator.llm.error error=ValueError(‘Error raised by inference API: Request failed during generation: Server error: Out of available cache blocks: asked 54, only 8 free blocks’) C:\Users\chandrasekhar.m\AppData\Local\anaconda3\envs\plus\lib\site-packages\sanic\server\websockets\impl.py:521: DeprecationWarning: The explicit passing of coroutine objects to asyncio.wait() is deprecated since Python 3.8, and scheduled for removal in Python 3.11. done, pending = await asyncio.wait(
help me to understand how to use it.