To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN タハチロタタ称」n}タハチロタタ称」n{^ 1100000011001010110000011101101111000000110000001000111111001100101000110110111001111101110000001100101011000001110110111100000011000000100011111100110010100011011011100111101101011110 c0cac1dbc0c08fcca36e7dc0cac1dbc0c08fcca36e7b5e
EUC-JP タハチロタタ称」n}タハチロタタ称」n{^ 10001110110000001000111011001010100011101100000110001110110110111000111011000000100011101100000010111110110011101000111010100011011011100111110110001110110000001000111011001010100011101100000110001110110110111000111011000000100011101100000010111110110011101000111010100011011011100111101101011110 8ec08eca8ec18edb8ec08ec0bece8ea36e7d8ec08eca8ec18edb8ec08ec0bece8ea36e7b5e
UTF-8 タハチロタタ称」n}タハチロタタ称」n{^ 1110111110111110100000001110111110111110100010101110111110111110100000011110111110111110100110111110111110111110100000001110111110111110100000001110011110100111101100001110111110111101101000110110111001111101111011111011111010000000111011111011111010001010111011111011111010000001111011111011111010011011111011111011111010000000111011111011111010000000111001111010011110110000111011111011110110100011011011100111101101011110 efbe80efbe8aefbe81efbe9befbe80efbe80e7a7b0efbda36e7defbe80efbe8aefbe81efbe9befbe80efbe80e7a7b0efbda36e7b5e
UHC ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)