To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????P 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101010000 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f50
SJIS-WIN ?????????????????????P 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101010000 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f50
EUC-JP ?????????????????????P 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101010000 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f50
UTF-8 챵짙쩐챰혦쩍혦짹챤쨩혦짠챔쨘혦짼혧횧횎챙쩌P 11101100101100011011010111101100101001111001100111101100101010011001000011101100101100011011000011101101100110001010011011101100101010011000110111101101100110001010011011101100101001111011100111101100101100011010010011101100101010001010100111101101100110001010011011101100101001111010000011101100101100011001010011101100101010001001100011101101100110001010011011101100101001111011110011101101100110001010011111101101100110101010011111101101100110101000111011101100101100011001100111101100101010011000110001010000 ecb1b5eca799eca990ecb1b0ed98a6eca98ded98a6eca7b9ecb1a4eca8a9ed98a6eca7a0ecb194eca898ed98a6eca7bced98a7ed9aa7ed9a8eecb199eca98c50
UHC 챵짙쩐챰혦쩍혦짹챤쨩혦짠챔쨘혦짼혧횧횎챙쩌P 11000011101100101100001010100011110000101011111011000011101100011100001010001110110000101011110111000010100011101100001010110001110000111010111011000010101110111100001010001110110000101010011111000011101010001100001010111010110000101000111011000010101100101100001010001111110000111001111011000011100010101100001110101100110000101011110001010000 c3b2c2a3c2bec3b1c28ec2bdc28ec2b1c3aec2bbc28ec2a7c3a8c2bac28ec2b2c28fc39ec38ac3acc2bc50

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)