To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??意??喩??艶c????諛??額 1110000110011111001111110011111110001000110100110011111100111111100110100110011100111111001111111000100110010000100000101000001100111111001111110011111100111111111001101000011100111111001111111000101001111010 e19f3f3f88d33f3f9a673f3f899082833f3f3f3fe6873f3f8a7a
EUC-JP 癲??意??喩??艶c?彛??諛??額 11100010101000010011111100111111101100001101010100111111001111111101001111001000001111110011111110110001111100001010001111100011001111111000111110111100111110100011111100111111111010111110011100111111001111111011001111011011 e2a13f3fb0d53f3fd3c83f3fb1f0a3e33f8fbcfa3f3febe73f3fb3db
UTF-8 癲⑸뜄意욇윀喩볝걶艶c끉彛뗥넇諛멸턁額 111001111001100110110010111000101001000110111000111010111001110010000100111001101000010010001111111011001001101010000111111011001001110010000000111001011001011010101001111010111011001110011101111010101011000110110110111010001000100110110110111011111011110110000011111010111000000110001001111001011011110110011011111010111001011110100101111010111000010010000111111010001010101110011011111010111010100110111000111011011000010010000001111010011010000110001101 e799b2e291b8eb9c84e6848fec9a87ec9c80e596a9ebb39deab1b6e889b6efbd83eb8189e5bd9beb97a5eb8487e8ab9beba9b8ed8481e9a18d
UHC 癲⑸뜄意욇윀喩볝걶艶c끉彛뗥넇諛멸턁額 1110111110100110101010011110101110001101100010001110101111110010100111101110100110011111100010111110101011100111100100111110001110000001100111001110011011111101101000111110001110000101101111001110110010101101100010111110010110000110100101111110101110110000101110001110101010110101100111011110010011111110 efa6a9eb8d88ebf29ee99f8beae793e3819ce6fda3e385bcecad8be58697ebb0b8eab59de4fe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)