Shoot... wrote a reply and then closed the tab without posting. No back button for that.
So anyway, this seems like a terrible idea.
I've had my eye on these issues for almost 15 years now
https://www.webmasterworld.com/forum88/2256.htmhttps://www.webmasterworld.com/forum88/3543.htmWe've come a looonnng way in the interim, but I still see related issues.
Mixing character encodings break things. Technically, if you are using an extended character set, it's not a URL or a URI, it's an IRI (Internationalized Resource Identifier). That might help searching for info, but the RFC is here
http://www.ietf.org/rfc/rfc3987.txtAmong other things, it states: "For example, should one IRI be stored in a byte array in UTF-8 encoding form and the second in a UTF-16 encoding form, bit-for-bit comparisons applied naively will produce errors."
And just in general, are sure you will always be using UTF-8? Or UTF-8 BOM? Or UTF-16? Or Win-1252 which is " used by default in the legacy components of Microsoft Windows in English" (
https://en.wikipedia.org/wiki/Windows-1252)? Are you sure that the web surfer will not be using any of those legacy Windows components? Because if he is and pastes your URL into an email and sends it, it will break?
Are you sure that your encoding in your headers will match the declaration on the page? Because if they are not the same, your URLs might break.
Also, what shows in your browser bar may not match the URL if you are using percent encoding as some browsers render it and others do not
http://stackoverflow.com/questions/29988255/browser-support-for-utf8-encoded-characters-in-urlsIs the server set to use double-byte encoding? That might affect things too.
https://www.ibm.com/support/knowledgecenter/ko/SSELE6_8.0.0.4/com.ibm.isam.doc_8.0.0.4/wrp_config/concept/con_utf8_sup_url.htmlAnd if you do end up percent encoding, you have to make sure all urls are encoded/decoded (which you should be doing anyway) and take that into account for cutting and pasting and so forth.
Your best case with percent encoding is that the URL is
http://example.com/super-product%E2%84%A2This seems like a path to pain for no good reason except vanity. The client is so proud of their trademark, they even want it in their URL, just like Ford and Microsoft and Tesla and Apple do with their URLs. Oh wait, reality check, NONE of them do that with their URLs. Is it because they don't have the resources to do so?