Skip to content

Lowercase input #134

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
jasperfunksmit opened this issue Jan 4, 2023 · 1 comment
Closed

Lowercase input #134

jasperfunksmit opened this issue Jan 4, 2023 · 1 comment

Comments

@jasperfunksmit
Copy link

This is an excellent tool. One issue i noticed though: The finder returns a result for "Canis lupus", but none for "canis lupus". Is it possible to circumvent this?

@dimus
Copy link
Member

dimus commented Feb 22, 2023

thank you @git-arbitrarysystems! Unfortunately it is not currently possible to find low-case scientific names. There are two reasons for this:

  1. False positives. Scientific names sometimes are the same as "normal" words. For example, such genera as 'America', 'Cafeteria', 'Cancer'. GNfinder tries to strike a balance between finding names and missing name-like words. By rules of nomenclature generic part of the name must be capitalized, and GNfinder follows this rule to avoid multi-fold increase in false positives.
  2. The speed of name-finding would decrease significantly as well. Now the app has to check every capitalized word. If it would try to check every word, it would affect the speed. Speed is important, as we use the app to traverse billions of pages.

Because of large number of false positives and decrease of performance, finding names with non-capitalized genera would not be a feature we want to implement. With advent of AI and machine learning we might be able to address this issue, but that would require a complete rethink/rewrite of the app.

@dimus dimus closed this as completed Feb 22, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants