To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN 繹?????仰??v繹?????仰??vB 11100011100010000011111100111111001111110011111100111111100010111100001000111111001111110111011011100011100010000011111100111111001111110011111100111111100010111100001000111111001111110111011001000010 e3883f3f3f3f3f8bc23f3f76e3883f3f3f3f3f8bc23f3f7642
EUC-JP 繹?????仰??v繹?????仰??vB 11100101111010000011111100111111001111110011111100111111101101101100010000111111001111110111011011100101111010000011111100111111001111110011111100111111101101101100010000111111001111110111011001000010 e5e83f3f3f3f3fb6c43f3f76e5e83f3f3f3f3fb6c43f3f7642
UTF-8 繹먮젾惡욌젳仰뜻땼v繹먮젾惡욌젳仰뜻땼vB 111001111011100110111001111010111010100010101110111011001010000010111110111011111010011010111001111011001001101010001100111011001010000010110011111001001011101110110000111010111001110010111011111010111001010110111100011101101110011110111001101110011110101110101000101011101110110010100000101111101110111110100110101110011110110010011010100011001110110010100000101100111110010010111011101100001110101110011100101110111110101110010101101111000111011001000010 e7b9b9eba8aeeca0beefa6b9ec9a8ceca0b3e4bbb0eb9cbbeb95bc76e7b9b9eba8aeeca0beefa6b9ec9a8ceca0b3e4bbb0eb9cbbeb95bc7642
UHC 繹먮젾惡욌젳仰뜻땼v繹먮젾惡욌젳仰뜻땼vB 111001101011101010010000111010111010000010110000111001111111011110011110111010111010000010100111111001001110011010110110111001101000101110010010011101101110011010111010100100001110101110100000101100001110011111110111100111101110101110100000101001111110010011100110101101101110011010001011100100100111011001000010 e6ba90eba0b0e7f79eeba0a7e4e6b6e68b9276e6ba90eba0b0e7f79eeba0a7e4e6b6e68b927642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)