To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??宜??淫??艶k?循??蟻??筌 111000011001111100111111001111111000101101011000001111110011111110001000111110100011111100111111100010011001000010000010100010110011111110001111011110100011111100111111100010110110000100111111001111111110001010100011 e19f3f3f8b583f3f88fa3f3f8990828b3f8f7a3f3f8b613f3fe2a3
EUC-JP 癲??宜??淫??艶k?循??蟻??筌 111000101010000100111111001111111011010110111001001111110011111110110000111111000011111100111111101100011111000010100011111010110011111110111101110110110011111100111111101101011100001000111111001111111110010010100101 e2a13f3fb5b93f3fb0fc3f3fb1f0a3eb3fbddb3f3fb5c23f3fe4a5
UTF-8 癲덈챶宜뤄쭒淫뚯녃艶k챷循들쉬蟻녾덱筌 111001111001100110110010111010111000110110001000111011001011000110110110111001011010111010011100111010111010010010000100111011001010110110010010111001101011011110101011111010111001101010101111111010111000010110000011111010001000100110110110111011111011110110001011111011001011000110110111111001011011111010101010111010111001001110100100111011001000100110101100111010001001111110111011111010111000010110111110111010111000110110110001111001111010110110001100 e799b2eb8d88ecb1b6e5ae9ceba484ecad92e6b7abeb9aafeb8583e889b6efbd8becb1b7e5beaaeb93a4ec89ace89fbbeb85beeb8db1e7ad8c
UHC 癲덈챶宜뤄쭒淫뚯녃艶k챷循들쉬蟻녾덱筌 1110111110100110100010001110101110101010100000111110101111110001101101111110111110100111100010101110101111100010100011001110110010000110101110111110011011111101101000111110101110101010100001001110001011100000101101011110100110111101101011001110101111111100100001101110101010110101101001101110111110100111 efa688ebaa83ebf1b7efa78aebe28cec86bbe6fda3ebaa84e2e0b5e9bdacebfc86eab5a6efa7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)