Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support content in languages other than English #696

Open
natoverse opened this issue Jul 25, 2024 · 1 comment
Open

Support content in languages other than English #696

natoverse opened this issue Jul 25, 2024 · 1 comment
Labels
enhancement New feature or request

Comments

@natoverse
Copy link
Collaborator

natoverse commented Jul 25, 2024

GraphRAG does not explicitly support any particular language, however, the prompts are written in English and most of our evaluation has been done using English-language datasets. Many users would like to use GraphRAG for non-English datasets, and have reported varying levels of success. GraphRAG performance may vary across languages based on prompting, encoding/tokenizing, and the training and biases of the chosen model.

While we don't plan to implement explicit features or support for any language in particular at this time, there are a number of things users can do to try and improve non-English language support. A few examples:

  • Tune the prompts to request responses in a specific language. Notably, you can use our auto-tuning CLI tool and specify the language to use.
  • Rewrite the prompts in your language of choice. If you have used the init command to generate your starting config, all of the prompts are exported as text files that you can edit.
  • Experiment with different models. We haven't confirmed any specific model/language alignments, but please do experiment and report back in the discussion comments if you have helpful results. If you need to try a non-OpenAI model that is trained on a language other than English, please see issue Support model providers other than OpenAI and Azure #657.
@natoverse
Copy link
Collaborator Author

Some helpful content for users with Chinese content could be here: #596

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant