To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 淨?垣兢?肯?怨峰?淨?垣兢?肯?怨峰?^ 100111111100010000111111100010100101111110011001010111010011111110001101011011010011111110001001100001011001010111110100001111111001111111000100001111111000101001011111100110010101110100111111100011010110110100111111100010011000010110010101111101000011111101011110 9fc43f8a5f995d3f8d6d3f898595f43f9fc43f8a5f995d3f8d6d3f898595f43f5e
EUC-JP 淨?垣兢?肯?怨峰汶淨?垣兢?肯?怨峰汶^ 11011110110001100011111110110011110000001101000110111110001111111011100111001110001111111011000111100101110010101111011010001111110001101110010111011110110001100011111110110011110000001101000110111110001111111011100111001110001111111011000111100101110010101111011010001111110001101110010101011110 dec63fb3c0d1be3fb9ce3fb1e5caf68fc6e5dec63fb3c0d1be3fb9ce3fb1e5caf68fc6e55e
UTF-8 淨렠垣兢렚肯떵怨峰汶淨렠垣兢렚肯떵怨峰汶^ 11100110101101111010100011101011101000001010000011100101100111101010001111100101100001011010001011101011101000001001101011101000100000101010111111101011100101101011010111100110100000001010100011100101101100111011000011100110101100011011011011100110101101111010100011101011101000001010000011100101100111101010001111100101100001011010001011101011101000001001101011101000100000101010111111101011100101101011010111100110100000001010100011100101101100111011000011100110101100011011011001011110 e6b7a8eba0a0e59ea3e585a2eba09ae882afeb96b5e680a8e5b3b0e6b1b6e6b7a8eba0a0e59ea3e585a2eba09ae882afeb96b5e680a8e5b3b0e6b1b65e
UHC 淨렠垣兢렚肯떵怨峰汶淨렠垣兢렚肯떵怨峰汶^ 1110111111100100100011101011000111101010101011111101000011100111100011101010110111010000111010011011011010111010111010101011001111011100111010001101101010100001111011111110010010001110101100011110101010101111110100001110011110001110101011011101000011101001101101101011101011101010101100111101110011101000110110101010000101011110 efe48eb1eaafd0e78eadd0e9b6baeab3dce8daa1efe48eb1eaafd0e78eadd0e9b6baeab3dce8daa15e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)