To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
EUC-JP ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
UTF-8 챨쨔쨋챰혦쩍혦쨩챨첨혦짖챗짜혦쩍혧횉체횞짙 111011001011000110101000111011001010100010010100111011001010100010001011111011001011000110110000111011011001100010100110111011001010100110001101111011011001100010100110111011001010100010101001111011001011000110101000111011001011001010101000111011011001100010100110111011001010011110010110111011001011000110010111111011001010011110011100111011011001100010100110111011001010100110001101111011011001100010100111111011011001101010001001111011001011001010110100111011011001101010011110111011001010011110011001 ecb1a8eca894eca88becb1b0ed98a6eca98ded98a6eca8a9ecb1a8ecb2a8ed98a6eca796ecb197eca79ced98a6eca98ded98a7ed9a89ecb2b4ed9a9eeca799
UHC 챨쨔쨋챰혦쩍혦쨩챨첨혦짖챗짜혦쩍혧횉체횞짙 110000111011000011000010101110011100001010110110110000111011000111000010100011101100001010111101110000101000111011000010101110111100001110110000110000111011011111000010100011101100001010100010110000111010101011000010101001011100001010001110110000101011110111000010100011111100001110000111110000111011110011000011100101111100001010100011 c3b0c2b9c2b6c3b1c28ec2bdc28ec2bbc3b0c3b7c28ec2a2c3aac2a5c28ec2bdc28fc387c3bcc397c2a3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)