To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲?8踰??攸?????濡λ?瑤??吟 111000011001111100111111100000100101011111100110111110100011111100111111100111011011111100111111001111110011111100111111001111111001010001000111100000111100100100111111111010101010001000111111001111111000101111100001 e19f3f8257e6fa3f3f9dbf3f3f3f3f3f944783c93feaa23f3f8be1
EUC-JP 癲?8踰??攸??轝??濡λ?瑤??吟 1110001010100001001111111010001110111000111011001111110000111111001111111101101011000001001111110011111110001111111000011010101000111111001111111100011110101000101001101100101100111111111101001010010000111111001111111011011011100011 e2a13fa3b8ecfc3f3fdac13f3f8fe1aa3f3fc7a8a6cb3ff4a43f3fb6e3
UTF-8 癲쒕8踰딂굢攸곸냸轝얏쾮濡λ젞瑤노툖吟 1110011110011001101100101110110010010010100101011110111110111100100110001110100010111000101100001110101110010100100000101110101010110101101000101110011010010100101110001110101010110011101110001110101110000011101110001110100010111101100111011110110010010110100011111110110010111110101011101110011010111111101000011100111010111011111011001010000010011110111001111001000110100100111010111000010110111000111011011000100010010110111001011001000010011111 e799b2ec9295efbc98e8b8b0eb9482eab5a2e694b8eab3b8eb83b8e8bd9dec968fecbeaee6bfa1cebbeca09ee791a4eb85b8ed8896e5909f
UHC 癲쒕8踰딂굢攸곸냸轝얏쾮濡λ젞瑤노툖吟 1110111110100110100111001110101110100011101110001110101110110010100010101110100010000010100010011110101011110010100000011110110010000110100010001110011010101100101111101110011010110010100001011110101110100001101001011110101110100000100110001110100011111101101100111110101110111000100011011110101111100001 efa69ceba3b8ebb28ae88289eaf281ec8688e6acbee6b285eba1a5eba098e8fdb3ebb88debe1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)