To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 絶??絶??艶??臟??艶o??①?艶??^ 100100001110001000111111001111111001000011100010001111110011111110001001100100000011111100111111111001000110011000111111001111111000100110010000100000101000111100111111001111111000011101000000001111111000100110010000001111110011111101011110 90e23f3f90e23f3f89903f3fe4663f3f8990828f3f3f87403f89903f3f5e
EUC-JP 絶??絶??艶??臟??艶o????艶??^ 1100000011100100001111110011111111000000111001000011111100111111101100011111000000111111001111111110011111000111001111110011111110110001111100001010001111101111001111110011111100111111001111111011000111110000001111110011111101011110 c0e43f3fc0e43f3fb1f03f3fe7c73f3fb1f0a3ef3f3f3f3fb1f03f3f5e
UTF-8 絶쀧윢絶뷁맖艶쀧윂臟놅풓艶o풐狀①윐艶랂뻻^ 11100111101101011011011011101100100000001010011111101100100111001010001011100111101101011011011011101011101101111000000111101011101001111001011011101000100010011011011011101100100000001010011111101100100111001000001011101000100001111001111111101011100001101000010111101101100100101001001111101000100010011011011011101111101111011000111111101101100100101001000011101111101001111011101011100010100100011010000011101100100111001001000011101000100010011011011011101011100111101000001011101011101110111011101101011110 e7b5b6ec80a7ec9ca2e7b5b6ebb781eba796e889b6ec80a7ec9c82e8879feb8685ed9293e889b6efbd8fed9290efa7bae291a0ec9c90e889b6eb9e82ebbbbb5e
UHC 絶쀧윢絶뷁맖艶쀧윂臟놅풓艶o풐狀①윐艶랂뻻^ 11101111101111101001011111100111100111111010001111101111101111101001010011101110100100001010100011100110111111011001011111100111100111111000110111101101111101001000011011101111101111101001011111100110111111011010001111101111101111101001010011101101111011101010100011100111100111111001011111100110111111011000110111101110100101101000011001011110 efbe97e79fa3efbe94ee90a8e6fd97e79f8dedf486efbe97e6fda3efbe94edeea8e79f97e6fd8dee96865e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)