To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 堤捧?褐哉堤捧?鞨?鋼堤捧?褐哉堤捧?鞨?鋼^ 100100101110011110010101111110010011111110001010100011001000110111000110100100101110011110010101111110010011111111101000111000000011111110001101011111001001001011100111100101011111100100111111100010101000110010001101110001101001001011100111100101011111100100111111111010001110000000111111100011010111110001011110 92e795f93f8a8c8dc692e795f93fe8e03f8d7c92e795f93f8a8c8dc692e795f93fe8e03f8d7c5e
EUC-JP 堤捧?褐哉堤捧?鞨?鋼堤捧?褐哉堤捧?鞨?鋼^ 110001001110100111001010111110110011111110110011111011001011101011001000110001001110100111001010111110110011111111110000111000100011111110111001110111011100010011101001110010101111101100111111101100111110110010111010110010001100010011101001110010101111101100111111111100001110001000111111101110011101110101011110 c4e9cafb3fb3ecbac8c4e9cafb3ff0e23fb9ddc4e9cafb3fb3ecbac8c4e9cafb3ff0e23fb9dd5e
UTF-8 堤捧렋褐哉堤捧렋鞨렏鋼堤捧렋褐哉堤捧렋鞨렏鋼^ 11100101101000001010010011100110100011011010011111101011101000001000101111101000101001001001000011100101100100111000100111100101101000001010010011100110100011011010011111101011101000001000101111101001100111101010100011101011101000001000111111101001100010111011110011100101101000001010010011100110100011011010011111101011101000001000101111101000101001001001000011100101100100111000100111100101101000001010010011100110100011011010011111101011101000001000101111101001100111101010100011101011101000001000111111101001100010111011110001011110 e5a0a4e68da7eba08be8a490e59389e5a0a4e68da7eba08be99ea8eba08fe98bbce5a0a4e68da7eba08be8a490e59389e5a0a4e68da7eba08be99ea8eba08fe98bbc5e
UHC 堤捧렋褐哉堤捧렋鞨렏鋼堤捧렋褐哉堤捧렋鞨렏鋼^ 111100001010011111011100111010011000111010100010110010101110100011101110101000111111000010100111110111001110100110001110101000101100101011101010100011101010010111001011101111001111000010100111110111001110100110001110101000101100101011101000111011101010001111110000101001111101110011101001100011101010001011001010111010101000111010100101110010111011110001011110 f0a7dce98ea2cae8eea3f0a7dce98ea2caea8ea5cbbcf0a7dce98ea2cae8eea3f0a7dce98ea2caea8ea5cbbc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)