To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????H???????????? 00111111001111110011111100111111001111110011111100111111001111110011111101001000001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f483f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?????????H???????????? 00111111001111110011111100111111001111110011111100111111001111110011111101001000001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f483f3f3f3f3f3f3f3f3f3f3f3f
EUC-JP ?????????H???????????? 00111111001111110011111100111111001111110011111100111111001111110011111101001000001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f483f3f3f3f3f3f3f3f3f3f3f3f
UTF-8 챌챦혟챘혶혮챘첸쨘H챔첫혚챘짭혡챙짯혚챦짠쨉 11101100101100011000110011101100101100011010011011101101100110001001111111101100101100011001100011101101100110001011011011101101100110001010111011101100101100011001100011101100101100101011100011101100101010001001100001001000111011001011000110010100111011001011001010101011111011011001100010011010111011001011000110011000111011001010011110101101111011011001100010100001111011001011000110011001111011001010011110101111111011011001100010011010111011001011000110100110111011001010011110100000111011001010100010001001 ecb18cecb1a6ed989fecb198ed98b6ed98aeecb198ecb2b8eca89848ecb194ecb2abed989aecb198eca7aded98a1ecb199eca7afed989aecb1a6eca7a0eca889
UHC 챌챦혟챘혶혮챘첸쨘H챔첫혚챘짭혡챙짯혚챦짠쨉 11000011101001111100001110101111110000101000100111000011101010111100001010011101110000101001010111000011101010111100001110111110110000101011101001001000110000111010100011000011101110011100001010000101110000111010101111000010101011001100001010001010110000111010110011000010101011011100001010000101110000111010111111000010101001111100001010110101 c3a7c3afc289c3abc29dc295c3abc3bec2ba48c3a8c3b9c285c3abc2acc28ac3acc2adc285c3afc2a7c2b5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)