To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????B}v????????B}vB 0011111100111111001111110011111100111111001111110011111100111111010000100111110101110110001111110011111100111111001111110011111100111111001111110011111101000010011111010111011001000010 3f3f3f3f3f3f3f3f427d763f3f3f3f3f3f3f3f427d7642
SJIS-WIN 堤???沚基??B}v堤???沚基??B}vB 1001001011100111001111110011111100111111100111111000110110001010111011100011111100111111010000100111110101110110100100101110011100111111001111110011111110011111100011011000101011101110001111110011111101000010011111010111011001000010 92e73f3f3f9f8d8aee3f3f427d7692e73f3f3f9f8d8aee3f3f427d7642
EUC-JP 堤???沚基??B}v堤???沚基??B}vB 1100010011101001001111110011111100111111110111011110110110110100111100000011111100111111010000100111110101110110110001001110100100111111001111110011111111011101111011011011010011110000001111110011111101000010011111010111011001000010 c4e93f3f3fddedb4f03f3f427d76c4e93f3f3fddedb4f03f3f427d7642
UTF-8 堤비렰렑沚基렰렖B}v堤비렰렑沚基렰렖B}vB 11100101101000001010010011101011101110011000010011101011101000001011000011101011101000001001000111100110101100101001101011100101100111111011101011101011101000001011000011101011101000001001011001000010011111010111011011100101101000001010010011101011101110011000010011101011101000001011000011101011101000001001000111100110101100101001101011100101100111111011101011101011101000001011000011101011101000001001011001000010011111010111011001000010 e5a0a4ebb984eba0b0eba091e6b29ae59fbaeba0b0eba096427d76e5a0a4ebb984eba0b0eba091e6b29ae59fbaeba0b0eba096427d7642
UHC 堤비렰렑沚基렰렖B}v堤비렰렑沚基렰렖B}vB 111100001010011110111010111100011000111010111101100011101010011011110010101011111101000011110001100011101011110110001110101010110100001001111101011101101111000010100111101110101111000110001110101111011000111010100110111100101010111111010000111100011000111010111101100011101010101101000010011111010111011001000010 f0a7baf18ebd8ea6f2afd0f18ebd8eab427d76f0a7baf18ebd8ea6f2afd0f18ebd8eab427d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)