To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 枳???鄭?姐捧? 10011110011010110011111100111111001111111001001101000001001111111000100010110111100101011111100100111111 9e6b3f3f3f93413f88b795f93f
EUC-JP 枳?雩?鄭?姐捧? 110110111100110000111111100011111110011011111010001111111100010110100010001111111011000010111001110010101111101100111111 dbcc3f8fe6fa3fc5a23fb0b9cafb3f
UTF-8 枳렟雩렮鄭렩姐捧쒀 111001101001111010110011111010111010000010011111111010011001101110101001111010111010000010101110111010011000010010101101111010111010000010101001111001011010011110010000111001101000110110100111111011001001001010000000 e69eb3eba09fe99ba9eba0aee984adeba0a9e5a790e68da7ec9280
UHC 枳렟雩렮鄭렩姐捧쒀 111100101010110010001110101100001110100111101100100011101011101111101111111101111000111010110111111011101011101111011100111010011011111010101100 f2ac8eb0e9ec8ebbeff78eb7eebbdce9beac

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)