The documentation says "\w any character that matches \p{L}, \p{N}, \p{Mn}, or \p{Pc}" but I believe [Mc](https://www.fileformat.info/info/unicode/category/Mc/list.htm) should be considered a word character as well.
The documentation says "\w any character that matches \p{L}, \p{N}, \p{Mn}, or \p{Pc}" but I believe Mc should be considered a word character as well.