To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f
EUC-JP ??????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f
UTF-8 책쩍짝챘혳짢챙쨍혮책짜 111011001011000110000101111011001010100110001101111011001010011110011101111011001011000110011000111011011001100010110011111011001010011110100010111011001011000110011001111011001010100010001101111011011001100010101110111011001011000110000101111011001010011110011100 ecb185eca98deca79decb198ed98b3eca7a2ecb199eca88ded98aeecb185eca79c
UHC 책쩍짝챘혳짢챙쨍혮책짜 11000011101001011100001010111101110000101010011011000011101010111100001010011010110000101010100011000011101011001100001010111000110000101001010111000011101001011100001010100101 c3a5c2bdc2a6c3abc29ac2a8c3acc2b8c295c3a5c2a5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)