To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 鶯??諭ε?油??[鶯??諭ε?油??[^ 1110100111110010001111110011111110010111010000001000001111000011001111111001011011111011001111110011111101011011111010011111001000111111001111111001011101000000100000111100001100111111100101101111101100111111001111110101101101011110 e9f23f3f974083c33f96fb3f3f5be9f23f3f974083c33f96fb3f3f5b5e
EUC-JP 鶯??諭ε?油??[鶯??諭ε?油??[^ 1111001011110100001111110011111111001101101000011010011011000101001111111100110011111101001111110011111101011011111100101111010000111111001111111100110110100001101001101100010100111111110011001111110100111111001111110101101101011110 f2f43f3fcda1a6c53fccfd3f3f5bf2f43f3fcda1a6c53fccfd3f3f5b5e
UTF-8 鶯낅쓧諭ε즳油밸뱥[鶯낅쓧諭ε즳油밸뱥[^ 11101001101101101010111111101011100000101000010111101100100100111010011111101000101010111010110111001110101101011110110010100110101100111110011010110010101110011110101110110000101110001110101110110001101001010101101111101001101101101010111111101011100000101000010111101100100100111010011111101000101010111010110111001110101101011110110010100110101100111110011010110010101110011110101110110000101110001110101110110001101001010101101101011110 e9b6afeb8285ec93a7e8abadceb5eca6b3e6b2b9ebb0b8ebb1a55be9b6afeb8285ec93a7e8abadceb5eca6b3e6b2b9ebb0b8ebb1a55b5e
UHC 鶯낅쓧諭ε즳油밸뱥[鶯낅쓧諭ε즳油밸뱥[^ 111001011010001110000101111010111001110110001000111010111011000110100101111001011010001110000101111010101111101010111001111010111001001110001011010110111110010110100011100001011110101110011101100010001110101110110001101001011110010110100011100001011110101011111010101110011110101110010011100010110101101101011110 e5a385eb9d88ebb1a5e5a385eafab9eb938b5be5a385eb9d88ebb1a5e5a385eafab9eb938b5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)