This repository was archived by the owner on Jun 1, 2022. It is now read-only.

Description
Grab some sample files that contain UTF-8 characters.
also include fields in the XML that are CDATA + UTF-8 in them. (which technically means: don't convert but use these exact chars as plain text).
e.g. <headline><![CDATA[Location, quality and tranquility – a hidden gem in the blue chip suburb of kew]]></headline>