To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 頂???蒸?地獺n}頂???蒸?地獺n{^ 1001001010111000001111110011111100111111100011111111011000111111100100100110111011100000110110100110111001111101100100101011100000111111001111110011111110001111111101100011111110010010011011101110000011011010011011100111101101011110 92b83f3f3f8ff63f926ee0da6e7d92b83f3f3f8ff63f926ee0da6e7b5e
EUC-JP 頂???蒸?地獺n}頂???蒸?地獺n{^ 1100010010111010001111110011111100111111101111101111100000111111110000111100111111100000110111000110111001111101110001001011101000111111001111110011111110111110111110000011111111000011110011111110000011011100011011100111101101011110 c4ba3f3f3fbef83fc3cfe0dc6e7dc4ba3f3f3fbef83fc3cfe0dc6e7b5e
UTF-8 頂비렰렣蒸렧地獺n}頂비렰렣蒸렧地獺n{^ 1110100110100000100000101110101110111001100001001110101110100000101100001110101110100000101000111110100010010010101110001110101110100000101001111110010110011100101100001110011110001101101110100110111001111101111010011010000010000010111010111011100110000100111010111010000010110000111010111010000010100011111010001001001010111000111010111010000010100111111001011001110010110000111001111000110110111010011011100111101101011110 e9a082ebb984eba0b0eba0a3e892b8eba0a7e59cb0e78dba6e7de9a082ebb984eba0b0eba0a3e892b8eba0a7e59cb0e78dba6e7b5e
UHC 頂비렰렣蒸렧地獺n}頂비렰렣蒸렧地獺n{^ 11110000101000101011101011110001100011101011110110001110101101001111000111111010100011101011011011110010101000101101001110110111011011100111110111110000101000101011101011110001100011101011110110001110101101001111000111111010100011101011011011110010101000101101001110110111011011100111101101011110 f0a2baf18ebd8eb4f1fa8eb6f2a2d3b76e7df0a2baf18ebd8eb4f1fa8eb6f2a2d3b76e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)