To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???橈??歪??閻? 0011111100111111001111111001111011110100001111110011111110011000011000110011111100111111111010001000010100111111 3f3f3f9ef43f3f98633f3fe8853f
EUC-JP ???橈??歪??閻? 0011111100111111001111111101110011110110001111110011111111001111110001000011111100111111111011111110010100111111 3f3f3fdcf63f3fcfc43f3fefe53f
UTF-8 了묋뮅橈볩풄歪뉓구閻뚩 111011111010011010111010111010111010110010001011111010111010111010000101111001101010100110001000111010111011001110101001111011011001001010000100111001101010110110101010111010111000100110010011111010101011010110101100111010011001011010111011111010111001101010101001 efa6baebac8bebae85e6a988ebb3a9ed9284e6adaaeb8993eab5ace996bbeb9aa9
UHC 了묋뮅橈볩풄歪뉓구閻뚩 11101000111001111001000111101000100100101001010011101000111110101001001111101111101111101000110011101000111000001000011111101000101100011011100011100111101000101000110011101000 e8e791e89294e8fa93efbe8ce8e087e8b1b8e7a28ce8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)