To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 瘟??押??耶ュ?瘟??秧??蘖??娃 111000011000100100111111001111111000100110011111001111110011111110010110111010111000001110000101001111111110000110001001001111110011111111100010010111100011111100111111100111110101000000111111001111111000100010100001 e1893f3f899f3f3f96eb83853fe1893f3fe25e3f3f9f503f3f88a1
EUC-JP 瘟??押??耶ュ?瘟??秧??蘖??娃 111000011110100100111111001111111011001010100001001111110011111111001100111011011010010111100101001111111110000111101001001111110011111111100011101111110011111100111111110111011011000100111111001111111011000010100011 e1e93f3fb2a13f3fcceda5e53fe1e93f3fe3bf3f3fddb13f3fb0a3
UTF-8 瘟룩큹押뜻짎耶ュ콪瘟룩큹秧믥떥蘖띶겢娃 111001111001100010011111111010111010001110101001111011011000000110111001111001101000101010111100111010111001110010111011111011001010011110001110111010001000000010110110111000111000001110100101111011001011110110101010111001111001100010011111111010111010001110101001111011011000000110111001111001111010011110100111111010111010111110100101111010111001011010100101111010001001100010010110111010111001110110110110111010101011001010100010111001011010100010000011 e7989feba3a9ed81b9e68abceb9cbbeca78ee880b6e383a5ecbdaae7989feba3a9ed81b9e7a7a7ebafa5eb96a5e89896eb9db6eab2a2e5a883
UHC 瘟룩큹押뜻짎耶ュ콪瘟룩큹秧믥떥蘖띶겢娃 1110100010110000101101111110100010110100100010001110010011100011101101101110011010100011100110101110010110101101101010111110010110110001100111101110100010110000101101111110100010110100100010001110010011101011100100101110011110001011101110001110010111101110100011011110010110000001101101001110100011011111 e8b0b7e8b488e4e3b6e6a39ae5adabe5b19ee8b0b7e8b488e4eb92e78bb8e5ee8de581b4e8df

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)