Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

SyntaxWarnings Due to Invalid Escape Sequences in DropTags.py #12

Closed
smohr opened this issue Feb 9, 2025 · 2 comments · Fixed by #13
Closed

SyntaxWarnings Due to Invalid Escape Sequences in DropTags.py #12

smohr opened this issue Feb 9, 2025 · 2 comments · Fixed by #13

Comments

@smohr
Copy link
Contributor

smohr commented Feb 9, 2025

When running OneTrainer, several SyntaxWarnings are raised because of invalid escape sequences in the DropTags.py module. These warnings indicate that certain escape sequences (e.g., \{, \), and \() are not properly formatted in Python string literals. This may lead to unintended behavior in regex processing.

Steps to Reproduce:

  1. Start OneTrainer

  2. Observe the following warnings in the console output:

    /home/sm/Applications/OneTrainer/venv/src/mgds/src/mgds/pipelineModules/DropTags.py:75: SyntaxWarning: invalid escape sequence '\{'
      regex_spchars = set(".^$*+?!\{\}\[\]|()\\")
    /home/sm/Applications/OneTrainer/venv/src/mgds/src/mgds/pipelineModules/DropTags.py:78: SyntaxWarning: invalid escape sequence '\)'
      c = c.replace("\)", "\\\\\)")
    /home/sm/Applications/OneTrainer/venv/src/mgds/src/mgds/pipelineModules/DropTags.py:79: SyntaxWarning: invalid escape sequence '\('
      c = c.replace("\(", "\\\\\(")   #hopefully fix issues caused by "\(\)" syntax without affecting other regex
    

Expected Behavior:

  • The application should run without any SyntaxWarnings.
  • Regex special characters should be correctly escaped so that regex operations perform as expected without any warnings.

Environment:

  • OS: Ubuntu Plucky Puffin (development branch)
  • Python: 3.12
Nerogar added a commit that referenced this issue Feb 15, 2025
…p-tags

fix: invalid escape sequences in DropTags module (#12)
@maedtb
Copy link

maedtb commented Mar 17, 2025

I think this change removed support for regex matching, as you are now calling re.escape on the entire pattern. This effectively makes all tag dropping use regular string matching, but in a much slower way because it uses the regex engine.

You should add "The application continues to support using regular expressions to match tags" to your expected behavior list, as not removing the feature you are fixing is important.

@smohr
Copy link
Contributor Author

smohr commented Mar 17, 2025

@maedtb Thanks for catching that. We should probably think about adding unit tests specifically to verify that regex-based matching continues to work as intended and doesn’t regress in future changes. Regarding the remark about the expected behavior list - I appreciate the feedback, but let’s keep the discussion focused on solving the issue rather than adding unnecessary puns.

please review if it works as intended with this pr #19

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants