To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 蒸基??甑???咀?財?蒸基??甑???咀?財?B 1000111111110110100010101110111000111111001111111000110110011001001111110011111100111111100110011111000000111111100011011110000000111111100011111111011010001010111011100011111100111111100011011001100100111111001111110011111110011001111100000011111110001101111000000011111101000010 8ff68aee3f3f8d993f3f3f99f03f8de03f8ff68aee3f3f8d993f3f3f99f03f8de03f42
EUC-JP 蒸基??甑???咀?財?蒸基??甑???咀?財?B 1011111011111000101101001111000000111111001111111011100111111001001111110011111100111111110100101111001000111111101110101110001000111111101111101111100010110100111100000011111100111111101110011111100100111111001111110011111111010010111100100011111110111010111000100011111101000010 bef8b4f03f3fb9f93f3f3fd2f23fbae23fbef8b4f03f3fb9f93f3f3fd2f23fbae23f42
UTF-8 蒸基렰렚甑비렰렑咀렭財죽蒸基렰렚甑비렰렑咀렭財죽B 11101000100100101011100011100101100111111011101011101011101000001011000011101011101000001001101011100111100101001001000111101011101110011000010011101011101000001011000011101011101000001001000111100101100100101000000011101011101000001010110111101000101100101010000111101100101000111011110111101000100100101011100011100101100111111011101011101011101000001011000011101011101000001001101011100111100101001001000111101011101110011000010011101011101000001011000011101011101000001001000111100101100100101000000011101011101000001010110111101000101100101010000111101100101000111011110101000010 e892b8e59fbaeba0b0eba09ae79491ebb984eba0b0eba091e59280eba0ade8b2a1eca3bde892b8e59fbaeba0b0eba09ae79491ebb984eba0b0eba091e59280eba0ade8b2a1eca3bd42
UHC 蒸基렰렚甑비렰렑咀렭財죽蒸基렰렚甑비렰렑咀렭財죽B 11110001111110101101000011110001100011101011110110001110101011011111000111110111101110101111000110001110101111011000111010100110111011101011101010001110101110101110111010101111110000011101011111110001111110101101000011110001100011101011110110001110101011011111000111110111101110101111000110001110101111011000111010100110111011101011101010001110101110101110111010101111110000011101011101000010 f1fad0f18ebd8eadf1f7baf18ebd8ea6eeba8ebaeeafc1d7f1fad0f18ebd8eadf1f7baf18ebd8ea6eeba8ebaeeafc1d742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)