To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 咀?麟彫?咀??愛蕩 10011001111100000011111110010111110110011001001010100100001111111001100111110000001111110011111110001000101001001001001110100000 99f03f97d992a43f99f03f3f88a493a0
EUC-JP 咀?麟彫?咀??愛蕩 11010010111100100011111111001110110110111100010010100110001111111101001011110010001111110011111110110000101001101100011010100010 d2f23fcedbc4a63fd2f23f3fb0a6c6a2
UTF-8 咀쾅麟彫렪咀쾀뤇愛蕩 111001011001001010000000111011001011111010000101111010011011101010011111111001011011110110101011111010111010000010101010111001011001001010000000111011001011111010000000111010111010010010000111111001101000010010011011111010001001010110101001 e59280ecbe85e9ba9fe5bdabeba0aae59280ecbe80eba487e6849be895a9
UHC 咀쾅麟彫렪咀쾀뤇愛蕩 1110111010111010110001001110011111010111111110001111000011000001100011101011100011101110101110101100010011100110100011111011011111100100111100011111011110111001 eebac4e7d7f8f0c18eb8eebac4e68fb7e4f1f7b9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)