To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 濡??濡??意??濡??濡?????濡??^ 10010100010001110011111100111111100101000100011100111111001111111000100011010011001111110011111110010100010001110011111100111111100101000100011100111111001111110011111100111111001111111001010001000111001111110011111101011110 94473f3f94473f3f88d33f3f94473f3f94473f3f3f3f3f94473f3f5e
EUC-JP 濡??濡??意??濡??濡?????濡??^ 11000111101010000011111100111111110001111010100000111111001111111011000011010101001111110011111111000111101010000011111100111111110001111010100000111111001111110011111100111111001111111100011110101000001111110011111101011110 c7a83f3fc7a83f3fb0d53f3fc7a83f3fc7a83f3f3f3f3fc7a83f3f5e
UTF-8 濡먮죿濡뉖젚意붾죱濡쀫죶濡쀫죬溜뽰뵒濡뚮죻^ 11100110101111111010000111101011101010001010111011101100101000111011111111100110101111111010000111101011100010011001011011101100101000001001101011100110100001001000111111101011101101101011111011101100101000111011000111100110101111111010000111101100100000001010101111101100101000111011011011100110101111111010000111101100100000001010101111101100101000111010110011101111101001111000101111101011101111011011000011101011101101011001001011100110101111111010000111101011100110101010111011101100101000111011101101011110 e6bfa1eba8aeeca3bfe6bfa1eb8996eca09ae6848febb6beeca3b1e6bfa1ec80abeca3b6e6bfa1ec80abeca3acefa78bebbdb0ebb592e6bfa1eb9aaeeca3bb5e
UHC 濡먮죿濡뉖젚意붾죱濡쀫죶濡쀫죬溜뽰뵒濡뚮죻^ 11101011101000011001000011101011101000011001011111101011101000011000011111101011101000001001011011101011111100101001010011101011101000011000110011101011101000011001011111101011101000011001000011101011101000011001011111101011101000011000011111101010111111101001011011101100100101001001010011101011101000011000110011101011101000011001010101011110 eba190eba197eba187eba096ebf294eba18ceba197eba190eba197eba187eafe96ec9494eba18ceba1955e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)