To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????D?????????D^ 001111110011111100111111001111110011111100111111001111110011111100111111010001000011111100111111001111110011111100111111001111110011111100111111001111110100010001011110 3f3f3f3f3f3f3f3f3f443f3f3f3f3f3f3f3f3f445e
SJIS-WIN 鶯??泣??惟??D鶯??泣??惟??D^ 111010011111001000111111001111111000101110000011001111110011111110001000110100100011111100111111010001001110100111110010001111110011111110001011100000110011111100111111100010001101001000111111001111110100010001011110 e9f23f3f8b833f3f88d23f3f44e9f23f3f8b833f3f88d23f3f445e
EUC-JP 鶯??泣??惟??D鶯??泣??惟??D^ 111100101111010000111111001111111011010111100011001111110011111110110000110101000011111100111111010001001111001011110100001111110011111110110101111000110011111100111111101100001101010000111111001111110100010001011110 f2f43f3fb5e33f3fb0d43f3f44f2f43f3fb5e33f3fb0d43f3f445e
UTF-8 鶯낃퉵泣ⓨ쮦惟듭뒴D鶯낃퉵泣ⓨ쮦惟듭뒴D^ 111010011011011010101111111010111000001010000011111011011000100110110101111001101011001110100011111000101001001110101000111011001010111010100110111001101000001110011111111010111001001110101101111010111001001010110100010001001110100110110110101011111110101110000010100000111110110110001001101101011110011010110011101000111110001010010011101010001110110010101110101001101110011010000011100111111110101110010011101011011110101110010010101101000100010001011110 e9b6afeb8283ed89b5e6b3a3e293a8ecaea6e6839feb93adeb92b444e9b6afeb8283ed89b5e6b3a3e293a8ecaea6e6839feb93adeb92b4445e
UHC 鶯낃퉵泣ⓨ쮦惟듭뒴D鶯낃퉵泣ⓨ쮦惟듭뒴D^ 111001011010001110000101111010101011100110001101111010111110100010101000111001011010100010000011111010101110111010110101111011001000101010101101010001001110010110100011100001011110101010111001100011011110101111101000101010001110010110101000100000111110101011101110101101011110110010001010101011010100010001011110 e5a385eab98debe8a8e5a883eaeeb5ec8aad44e5a385eab98debe8a8e5a883eaeeb5ec8aad445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)