To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????LB 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110100110001000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f4c42
SJIS-WIN ?????????????????????LB 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110100110001000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f4c42
EUC-JP ?????????????????????LB 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110100110001000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f4c42
UTF-8 챌쨉쨋챗쨩혖챗짼혝챕징혥챙혝징챘혣혨챙혞짼LB 1110110010110001100011001110110010101000100010011110110010101000100010111110110010110001100101111110110010101000101010011110110110011000100101101110110010110001100101111110110010100111101111001110110110011000100111011110110010110001100101011110110010100111100101011110110110011000101001011110110010110001100110011110110110011000100111011110110010100111100101011110110010110001100110001110110110011000101000111110110110011000101010001110110010110001100110011110110110011000100111101110110010100111101111000100110001000010 ecb18ceca889eca88becb197eca8a9ed9896ecb197eca7bced989decb195eca795ed98a5ecb199ed989deca795ecb198ed98a3ed98a8ecb199ed989eeca7bc4c42
UHC 챌쨉쨋챗쨩혖챗짼혝챕징혥챙혝징챘혣혨챙혞짼LB 1100001110100111110000101011010111000010101101101100001110101010110000101011101111000010100000011100001110101010110000101011001011000010100001111100001110101001110000101010000111000010100011011100001110101100110000101000011111000010101000011100001110101011110000101000110011000010100100001100001110101100110000101000100011000010101100100100110001000010 c3a7c2b5c2b6c3aac2bbc281c3aac2b2c287c3a9c2a1c28dc3acc287c2a1c3abc28cc290c3acc288c2b24c42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)