feat: translate instruction when adapting prompt #1529

Yunnglin · 2024-10-18T09:50:51Z

When adapting prompts, both the examples and the corresponding instructions are important and need to be translated.

shahules786 · 2024-10-18T10:29:27Z

Hey @Yunnglin Did you observe better results when translating both? We observed pretty good results with only translating a few shot examples. Another reason why we did not translate instruction was that time it introduces ambiguity. FOr example, let's say if the instruction was "Output False if a number greater than zero", this would also translate the word "False", which then causes issues while post-processing.

Yunnglin · 2024-10-18T10:39:03Z

Whether to translate the "instruction" could perhaps be made into an option. When I was generating a Chinese dataset, I found that some "extractor prompts" would lack examples, which led to the generated data not being very effective.

shahules786 · 2024-10-18T12:45:05Z

@Yunnglin that suggestion makes sense. Could you make a PR with the same?
@jjmachan Please share if you have any opinion on this.

jjmachan · 2024-10-18T15:34:37Z

This makes a ton of sense

what we can do is take another argument for

ragas/src/ragas/prompt/pydantic_prompt.py

Lines 217 to 223 in 807c313

    
               async def adapt( 
        
                   self, target_language: str, llm: BaseRagasLLM 
        
               ) -> "PydanticPrompt[InputModel, OutputModel]": 
        
                   """ 
        
                   Adapt the prompt to a new language. 
        
                   """

which converts it

and it can be controlled through

ragas/src/ragas/prompt/mixin.py

Line 55 in 807c313

async def adapt_prompts(

would be really helpful 🙂

Yunnglin · 2024-10-19T02:55:46Z

Add adapt_instruction: bool=False parameter.

Now you can adapt prompt as follows:

import asyncio
from ragas.metrics import Faithfulness
from ragas.llms import LangchainLLMWrapper


instance = Faithfulness()
adapted_prompts = asyncio.run(instance.adapt_prompts(language="chinese", llm=LangchainLLMWrapper(chat_model), adapt_instruction=True))
print(adapted_prompts)

Add Ensure that the number of output data rows is equal to the number of input data rows. to the translation prompt since sometimes LLM breaks the single line into multiple lines.

shahules786

LGTM, thanks a lot.

feat: translate instruction when adapting prompt

381af8e

dosubot bot added the size:XS This PR changes 0-9 lines, ignoring generated files. label Oct 18, 2024

shahules786 self-requested a review October 18, 2024 10:26

add parameter

a0a8166

dosubot bot added size:S This PR changes 10-29 lines, ignoring generated files. and removed size:XS This PR changes 0-9 lines, ignoring generated files. labels Oct 19, 2024

shahules786 approved these changes Oct 19, 2024

View reviewed changes

jjmachan merged commit 5481246 into explodinggradients:main Oct 19, 2024
16 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: translate instruction when adapting prompt #1529

feat: translate instruction when adapting prompt #1529

Yunnglin commented Oct 18, 2024

shahules786 commented Oct 18, 2024 •

edited

Loading

Yunnglin commented Oct 18, 2024

shahules786 commented Oct 18, 2024

jjmachan commented Oct 18, 2024

Yunnglin commented Oct 19, 2024

shahules786 left a comment

feat: translate instruction when adapting prompt #1529

feat: translate instruction when adapting prompt #1529

Conversation

Yunnglin commented Oct 18, 2024

shahules786 commented Oct 18, 2024 • edited Loading

Yunnglin commented Oct 18, 2024

shahules786 commented Oct 18, 2024

jjmachan commented Oct 18, 2024

Yunnglin commented Oct 19, 2024

shahules786 left a comment

Choose a reason for hiding this comment

shahules786 commented Oct 18, 2024 •

edited

Loading