To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??溢??鷹??嚥〓????諛??筌 1110000110011111001111110011111110001000111011000011111100111111100100011110100100111111001111111001101010001011100000011010110000111111001111110011111100111111111001101000011100111111001111111110001010100011 e19f3f3f88ec3f3f91e93f3f9a8b81ac3f3f3f3fe6873f3fe2a3
EUC-JP 癲??溢??鷹??嚥〓?瑗??諛??筌 11100010101000010011111100111111101100001110111000111111001111111100001011101011001111110011111111010011111010111010001010101110001111111000111111001100110000000011111100111111111010111110011100111111001111111110010010100101 e2a13f3fb0ee3f3fc2eb3f3fd3eba2ae3f8fccc03f3febe73f3fe4a5
UTF-8 癲뚮씛溢€뤃鷹됥렃嚥〓뀈瑗삣넇諛몃㎥筌 111001111001100110110010111010111001101010101110111011001001010010011011111001101011101010100010111000101000001010101100111010111010010010000011111010011011011110111001111010111001000010100101111010111010000010000011111001011001101010100101111000111000000010010011111010111000000010001000111001111001000110010111111011001000001010100011111010111000010010000111111010001010101110011011111010111010101010000011111000111000111010100101111001111010110110001100 e799b2eb9aaeec949be6baa2e282aceba483e9b7b9eb90a5eba083e59aa5e38093eb8088e79197ec82a3eb8487e8ab9bebaa83e38ea5e7ad8c
UHC 癲뚮씛溢€뤃鷹됥렃嚥〓뀈瑗삣넇諛몃㎥筌 1110111110100110100011001110101110011101101100001110110011101110101000101110011010001111101101001110101111101101100010011110001110001110100111011110011010111111101000011110101110000101100001001110101010111100101110111110010110000110100101111110101110110000101110001110101110100111101010011110111110100111 efa68ceb9db0eceea2e68fb4ebed89e38e9de6bfa1eb8584eabcbbe58697ebb0b8eba7a9efa7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)