To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 臟??章??蘂①?n}臟??章??蘂①?n{^ 11100100011001100011111100111111100011111100110100111111001111111110010101000001100001110100000000111111011011100111110111100100011001100011111100111111100011111100110100111111001111111110010101000001100001110100000000111111011011100111101101011110 e4663f3f8fcd3f3fe54187403f6e7de4663f3f8fcd3f3fe54187403f6e7b5e
EUC-JP 臟??章??蘂??n}臟??章??蘂??n{^ 1110011111000111001111110011111110111110110011110011111100111111111010011010001000111111001111110110111001111101111001111100011100111111001111111011111011001111001111110011111111101001101000100011111100111111011011100111101101011110 e7c73f3fbecf3f3fe9a23f3f6e7de7c73f3fbecf3f3fe9a23f3f6e7b5e
UTF-8 臟륃궙章든윐蘂①왃n}臟륃궙章든윐蘂①왃n{^ 1110100010000111100111111110101110100101100000111110101010110110100110011110011110101011101000001110101110010011101000001110110010011100100100001110100010011000100000101110001010010001101000001110110010011001100000110110111001111101111010001000011110011111111010111010010110000011111010101011011010011001111001111010101110100000111010111001001110100000111011001001110010010000111010001001100010000010111000101001000110100000111011001001100110000011011011100111101101011110 e8879feba583eab699e7aba0eb93a0ec9c90e89882e291a0ec99836e7de8879feba583eab699e7aba0eb93a0ec9c90e89882e291a0ec99836e7b5e
UHC 臟륃궙章든윐蘂①왃n}臟륃궙章든윐蘂①왃n{^ 1110110111110100100011111110111010000010101011101110110111110001101101011110011110011111100101111110011111011110101010001110011110011110101101100110111001111101111011011111010010001111111011101000001010101110111011011111000110110101111001111001111110010111111001111101111010101000111001111001111010110110011011100111101101011110 edf48fee82aeedf1b5e79f97e7dea8e79eb66e7dedf48fee82aeedf1b5e79f97e7dea8e79eb66e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)