To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 雅??衍????????語????????厭 1000100111101011001111110011111110011111101001010011111100111111001111110011111100111111001111110011111100111111100011001110101000111111001111110011111100111111001111110011111100111111001111111000100101111101 89eb3f3f9fa53f3f3f3f3f3f3f3f8cea3f3f3f3f3f3f3f3f897d
EUC-JP 雅??衍????????語????????厭 1011001011101101001111110011111111011110101001110011111100111111001111110011111100111111001111110011111100111111101110001110110000111111001111110011111100111111001111110011111100111111001111111011000111011110 b2ed3f3fdea73f3f3f3f3f3f3f3fb8ec3f3f3f3f3f3f3f3fb1de
UTF-8 雅먮젙衍뚨쭅溜졿짂溜잙젵語ⓨ날溜싦펯溜잙젵厭 111010011001101110000101111010111010100010101110111011001010000010011001111010001010000110001101111010111001101010101000111011001010110110000101111011111010011110001011111011001010000110111111111011001010011110000010111011111010011110001011111011001001111010011001111011001010000010110101111010001010101010011110111000101001001110101000111010111000001010100000111011111010011110001011111011001000101110100110111011011000111010101111111011111010011110001011111011001001111010011001111011001010000010110101111001011000111010101101 e99b85eba8aeeca099e8a18deb9aa8ecad85efa78beca1bfeca782efa78bec9e99eca0b5e8aa9ee293a8eb82a0efa78bec8ba6ed8eafefa78bec9e99eca0b5e58ead
UHC 雅먮젙衍뚨쭅溜졿짂溜잙젵語ⓨ날溜싦펯溜잙젵厭 1110010010111010100100001110101110100000100101011110011011100010100011001110011110100111100000011110101011111110101000001110011010100011100100101110101011111110100111111110101110100000101010011110010111011110101010001110010110110011101011111110101011111110100110101110010010111100100000011110101011111110100111111110101110100000101010011110011011110100 e4ba90eba095e6e28ce7a781eafea0e6a392eafe9feba0a9e5dea8e5b3afeafe9ae4bc81eafe9feba0a9e6f4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)