To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????P?????????D??? 0011111100111111001111110011111100111111001111110011111100111111001111110101000000111111001111110011111100111111001111110011111100111111001111110011111101000100001111110011111100111111 3f3f3f3f3f3f3f3f3f503f3f3f3f3f3f3f3f3f443f3f3f
SJIS-WIN ?????????P?????????D??? 0011111100111111001111110011111100111111001111110011111100111111001111110101000000111111001111110011111100111111001111110011111100111111001111110011111101000100001111110011111100111111 3f3f3f3f3f3f3f3f3f503f3f3f3f3f3f3f3f3f443f3f3f
EUC-JP ?????????P?????????D??? 0011111100111111001111110011111100111111001111110011111100111111001111110101000000111111001111110011111100111111001111110011111100111111001111110011111101000100001111110011111100111111 3f3f3f3f3f3f3f3f3f503f3f3f3f3f3f3f3f3f443f3f3f
UTF-8 챔혪쨩챘짧혣챙혞짱P챔혪쨩챘짧혣챙혟짜D챔혪쨩 1110110010110001100101001110110110011000101010101110110010101000101010011110110010110001100110001110110010100111101001111110110110011000101000111110110010110001100110011110110110011000100111101110110010100111101100010101000011101100101100011001010011101101100110001010101011101100101010001010100111101100101100011001100011101100101001111010011111101101100110001010001111101100101100011001100111101101100110001001111111101100101001111001110001000100111011001011000110010100111011011001100010101010111011001010100010101001 ecb194ed98aaeca8a9ecb198eca7a7ed98a3ecb199ed989eeca7b150ecb194ed98aaeca8a9ecb198eca7a7ed98a3ecb199ed989feca79c44ecb194ed98aaeca8a9
UHC 챔혪쨩챘짧혣챙혞짱P챔혪쨩챘짧혣챙혟짜D챔혪쨩 1100001110101000110000101001001011000010101110111100001110101011110000101010101011000010100011001100001110101100110000101000100011000010101011110101000011000011101010001100001010010010110000101011101111000011101010111100001010101010110000101000110011000011101011001100001010001001110000101010010101000100110000111010100011000010100100101100001010111011 c3a8c292c2bbc3abc2aac28cc3acc288c2af50c3a8c292c2bbc3abc2aac28cc3acc289c2a544c3a8c292c2bb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)