To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??倚ч????汚??誼?┐ 111000011001111100111111001111111001100011011111100001001000100100111111001111110011111100111111100010011001100000111111001111111000101101100010001111111000010010100010 e19f3f3f98df84893f3f3f3f89983f3f8b623f84a2
EUC-JP 癲??倚ч????汚??誼?┐ 111000101010000100111111001111111101000011100001101001111110100100111111001111110011111100111111101100011111100000111111001111111011010111000011001111111010100010100100 e2a13f3fd0e1a7e93f3f3f3fb1f83f3fb5c33fa8a4
UTF-8 癲ㅺ퀗倚ч씣戮녹춷汚살늿誼⑼┐ 1110011110011001101100101110001110000101101110101110110110000000100101111110010110000000100110101101000110000111111011001001010010100011111011111010011110010010111010111000010110111001111011001011011010110111111001101011000110011010111011001000001010110100111010111000101010111111111010001010101010111100111000101001000110111100111000101001010010010000 e799b2e385baed8097e5809ad187ec94a3efa792eb85b9ecb6b7e6b19aec82b4eb8abfe8aabce291bce29490
UHC 癲ㅺ퀗倚ч씣戮녹춷汚살늿誼⑼┐ 111011111010011010100100111010101011001110001100111010111110111110101100111010011001110110110111111010111011110110110011111011001010110110010011111001111111110110111011111011001000100010001000111010111111111010101001111011111010011010100100 efa6a4eab38cebeface99db7ebbdb3ecad93e7fdbbec8888ebfea9efa6a4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)