To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????~B 0011111100111111001111110011111100111111001111110011111100111111001111110111111001000010 3f3f3f3f3f3f3f3f3f7e42
SJIS-WIN ???爾??攸??~B 00111111001111110011111110001110101000100011111100111111100111011011111100111111001111110111111001000010 3f3f3f8ea23f3f9dbf3f3f7e42
EUC-JP ???爾??攸??~B 00111111001111110011111110111100101001000011111100111111110110101100000100111111001111110111111001000010 3f3f3fbca43f3fdac13f3f7e42
UTF-8 銳얜갭爾닺맱攸낆낯~B 1110100110001010101100111110110010010110100111001110101010110000101011011110011110001000101111101110101110001011101110101110101110100111101100011110011010010100101110001110101110000010100001101110101110000010101011110111111001000010 e98ab3ec969ceab0ade788beeb8bbaeba7b1e694b8eb8286eb82af7e42
UHC 銳얜갭爾닺맱攸낆낯~B 1110011111100101101111101110101110110000101110001110110010110011101101001110100010010000101110001110101011110010100001011110110010110011101110000111111001000010 e7e5beebb0b8ecb3b4e890b8eaf285ecb3b87e42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)