To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 岳??孺??喩??壓?????濡〓?倭 1000101001111000001111110011111110011011011111010011111100111111100110100110011100111111001111111001101011011000001111110011111100111111001111110011111110010100010001111000000110101100001111111001100001100000 8a783f3f9b7d3f3f9a673f3f9ad83f3f3f3f3f944781ac3f9860
EUC-JP 岳??孺??喩??壓??嫄??濡〓?倭 10110011110110010011111100111111110101011101111000111111001111111101001111001000001111110011111111010100110110100011111100111111100011111011101010100001001111110011111111000111101010001010001010101110001111111100111111000001 b3d93f3fd5de3f3fd3c83f3fd4da3f3f8fbaa13f3fc7a8a2ae3fcfc1
UTF-8 岳묒빘孺욤짆喩믩쇀壓믪궇嫄띌뼸濡〓굻倭 111001011011001010110011111010111010110010010010111010111011100110011000111001011010110110111010111011001001101010100100111011001010011110000110111001011001011010101001111010111010111110101001111011001000011110000000111001011010001110010011111010111010111110101010111010101011011010000111111001011010101110000100111010111001110110001100111010111011110010111000111001101011111110100001111000111000000010010011111010101011010110111011111001011000000010101101 e5b2b3ebac92ebb998e5adbaec9aa4eca786e596a9ebafa9ec8780e5a393ebafaaeab687e5ab84eb9d8cebbcb8e6bfa1e38093eab5bbe580ad
UHC 岳묒빘孺욤짆喩믩쇀壓믪궇嫄띌뼸濡〓굻倭 1110010010111111100100011110110010010101101110011110101011101000101111111110100010100011100101011110101011100111100100101110101110011001101101001110010011100010100100101110110010000010101000001110101010110001101101101110100110010110101110111110101110100001101000011110101110110001101111111110100011011110 e4bf91ec95b9eae8bfe8a395eae792eb99b4e4e292ec82a0eab1b6e996bbeba1a1ebb1bfe8de

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)