To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
EUC-JP ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
UTF-8 챤짯혦짧챔쩍횑챌혦짹횥챌혦쨌혦쩍챘첵횧챔쨔챕B 11101100101100011010010011101100101001111010111111101101100110001010011011101100101001111010011111101100101100011001010011101100101010011000110111101101100110101001000111101100101100011000110011101101100110001010011011101100101001111011100111101101100110101010010111101100101100011000110011101101100110001010011011101100101010001000110011101101100110001010011011101100101010011000110111101100101100011001100011101100101100101011010111101101100110101010011111101100101100011001010011101100101010001001010011101100101100011001010101000010 ecb1a4eca7afed98a6eca7a7ecb194eca98ded9a91ecb18ced98a6eca7b9ed9aa5ecb18ced98a6eca88ced98a6eca98decb198ecb2b5ed9aa7ecb194eca894ecb19542
UHC 챤짯혦짧챔쩍횑챌혦짹횥챌혦쨌혦쩍챘첵횧챔쨔챕B 110000111010111011000010101011011100001010001110110000101010101011000011101010001100001010111101110000111000110011000011101001111100001010001110110000101011000111000011100111001100001110100111110000101000111011000010101101111100001010001110110000101011110111000011101010111100001110111101110000111001111011000011101010001100001010111001110000111010100101000010 c3aec2adc28ec2aac3a8c2bdc38cc3a7c28ec2b1c39cc3a7c28ec2b7c28ec2bdc3abc3bdc39ec3a8c2b9c3a942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)