Fix handling of empty and multi-byte character literals #2214

mfelsche · 2017-09-05T11:31:53Z

This PR will treat empty character literals as invalid syntax
and handle weird utf8 character literals like '🎠' correctly.

The root cause was the casting from char to int which created ints like 0xFFFFFF9F for a lexer char \x9F. This was treated as negative and thus ignored by the lexer.

This fixes #2198 in that it treats multi-byte input correctly byte by byte.

Praetonus · 2017-09-05T16:12:10Z

Thank you!

The build failure looks unrelated, I've restarted it for good measure.

Fix handling of empty and utf8 character literals

929018c

Praetonus added the changelog - fixed Automatically add "Fixed" CHANGELOG entry on merge label Sep 5, 2017

jemc changed the title ~~Fix handling of empty and utf8 character literals~~ Fix handling of empty and multi-byte character literals Sep 5, 2017

SeanTAllen merged commit 219d54e into ponylang:master Sep 6, 2017

ponylang-main added a commit that referenced this pull request Sep 6, 2017

Update CHANGELOG for PR #2214 [skip ci]

de31b17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix handling of empty and multi-byte character literals #2214

Fix handling of empty and multi-byte character literals #2214

mfelsche commented Sep 5, 2017 •

edited

Loading

Praetonus commented Sep 5, 2017

Fix handling of empty and multi-byte character literals #2214

Fix handling of empty and multi-byte character literals #2214

Conversation

mfelsche commented Sep 5, 2017 • edited Loading

Praetonus commented Sep 5, 2017

mfelsche commented Sep 5, 2017 •

edited

Loading