To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲?8踰??攸?????濡λ?瑤??吟?? 1110000110011111001111111000001001010111111001101111101000111111001111111001110110111111001111110011111100111111001111110011111110010100010001111000001111001001001111111110101010100010001111110011111110001011111000010011111100111111 e19f3f8257e6fa3f3f9dbf3f3f3f3f3f944783c93feaa23f3f8be13f3f
EUC-JP 癲?8踰??攸??轝??濡λ?瑤??吟?? 11100010101000010011111110100011101110001110110011111100001111110011111111011010110000010011111100111111100011111110000110101010001111110011111111000111101010001010011011001011001111111111010010100100001111110011111110110110111000110011111100111111 e2a13fa3b8ecfc3f3fdac13f3f8fe1aa3f3fc7a8a6cb3ff4a43f3fb6e33f3f
UTF-8 癲쒕8踰딂굢攸곸냸轝얏쾮濡λ젞瑤노툖吟뤄쬅 1110011110011001101100101110110010010010100101011110111110111100100110001110100010111000101100001110101110010100100000101110101010110101101000101110011010010100101110001110101010110011101110001110101110000011101110001110100010111101100111011110110010010110100011111110110010111110101011101110011010111111101000011100111010111011111011001010000010011110111001111001000110100100111010111000010110111000111011011000100010010110111001011001000010011111111010111010010010000100111011001010110010000101 e799b2ec9295efbc98e8b8b0eb9482eab5a2e694b8eab3b8eb83b8e8bd9dec968fecbeaee6bfa1cebbeca09ee791a4eb85b8ed8896e5909feba484ecac85
UHC 癲쒕8踰딂굢攸곸냸轝얏쾮濡λ젞瑤노툖吟뤄쬅 111011111010011010011100111010111010001110111000111010111011001010001010111010001000001010001001111010101111001010000001111011001000011010001000111001101010110010111110111001101011001010000101111010111010000110100101111010111010000010011000111010001111110110110011111010111011100010001101111010111110000110110111111011111010011010011100 efa69ceba3b8ebb28ae88289eaf281ec8688e6acbee6b285eba1a5eba098e8fdb3ebb88debe1b7efa69c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)