To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????P???????????? 001111110011111100111111001111110011111100111111001111110011111101010000001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f503f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 堤???沚基??P堤???沚基??種?義? 1001001011100111001111110011111100111111100111111000110110001010111011100011111100111111010100001001001011100111001111110011111100111111100111111000110110001010111011100011111100111111100011101110110100111111100010110110000000111111 92e73f3f3f9f8d8aee3f3f5092e73f3f3f9f8d8aee3f3f8eed3f8b603f
EUC-JP 堤???沚基??P堤???沚基??種?義? 1100010011101001001111110011111100111111110111011110110110110100111100000011111100111111010100001100010011101001001111110011111100111111110111011110110110110100111100000011111100111111101111001110111100111111101101011100000100111111 c4e93f3f3fddedb4f03f3f50c4e93f3f3fddedb4f03f3fbcef3fb5c13f
UTF-8 堤비렰렑沚基렰렖P堤비렰렑沚基렰렖種렟義꿰 11100101101000001010010011101011101110011000010011101011101000001011000011101011101000001001000111100110101100101001101011100101100111111011101011101011101000001011000011101011101000001001011001010000111001011010000010100100111010111011100110000100111010111010000010110000111010111010000010010001111001101011001010011010111001011001111110111010111010111010000010110000111010111010000010010110111001111010100010101110111010111010000010011111111001111011111010101001111010101011111110110000 e5a0a4ebb984eba0b0eba091e6b29ae59fbaeba0b0eba09650e5a0a4ebb984eba0b0eba091e6b29ae59fbaeba0b0eba096e7a8aeeba09fe7bea9eabfb0
UHC 堤비렰렑沚基렰렖P堤비렰렑沚基렰렖種렟義꿰 1111000010100111101110101111000110001110101111011000111010100110111100101010111111010000111100011000111010111101100011101010101101010000111100001010011110111010111100011000111010111101100011101010011011110010101011111101000011110001100011101011110110001110101010111111000011111010100011101011000011101011111110011011001011100111 f0a7baf18ebd8ea6f2afd0f18ebd8eab50f0a7baf18ebd8ea6f2afd0f18ebd8eabf0fa8eb0ebf9b2e7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)