Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix: proportion_in_target_languages not considering tokens present in… #114

Merged

Conversation

juanjoDiaz
Copy link
Collaborator

… several languages.

Closes #112

@codecov
Copy link

codecov bot commented Sep 12, 2023

Codecov Report

Merging #114 (d9514a4) into main (fa1d964) will increase coverage by 0.05%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##             main     #114      +/-   ##
==========================================
+ Coverage   96.62%   96.67%   +0.05%     
==========================================
  Files          33       33              
  Lines         651      661      +10     
==========================================
+ Hits          629      639      +10     
  Misses         22       22              
Files Changed Coverage Δ
simplemma/language_detector.py 97.22% <100.00%> (+0.44%) ⬆️

@adbar
Copy link
Owner

adbar commented Sep 15, 2023

Thanks!

@adbar adbar merged commit 75561b6 into adbar:main Sep 15, 2023
15 checks passed
== in_target_language(text, lang=langs)
== 1.0
)
in_target_language("It was a true gift", lang=("en", "de"))
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why is this here?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

in_target_language can count words twice and return ratios above 1.0
3 participants