To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????^ 00111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 業??絶??臟わ?^ 1000101111000110001111110011111110010000111000100011111100111111111001000110011010000010111011010011111101011110 8bc63f3f90e23f3fe46682ed3f5e
EUC-JP 業??絶??臟わ?^ 1011011011001000001111110011111111000000111001000011111100111111111001111100011110100100111011110011111101011110 b6c83f3fc0e43f3fe7c7a4ef3f5e
UTF-8 業뉚뼧絶롦삀臟わ풆^ 11100110101001011010110111101011100010011001101011101011101111001010011111100111101101011011011011101011101000011010011011101100100000101000000011101000100001111001111111100011100000101000111111101101100100101000011001011110 e6a5adeb899aebbca7e7b5b6eba1a6ec8280e8879fe3828fed92865e
UHC 業뉚뼧絶롦삀臟わ풆^ 11100101111101101000011111101110100101101010101011101111101111101000111011100110100110001000011111101101111101001010101011101111101111101000111001011110 e5f687ee96aaefbe8ee69887edf4aaefbe8e5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)