To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 鏑??檍?增??檍?鏑??檍?增??檍?^ 1001001101001100001111110011111110011110111110000011111111111010100111010011111100111111100111101111100000111111100100110100110000111111001111111001111011111000001111111111101010011101001111110011111110011110111110000011111101011110 934c3f3f9ef83ffa9d3f3f9ef83f934c3f3f9ef83ffa9d3f3f9ef83f5e
EUC-JP 鏑??檍????檍?鏑??檍????檍?^ 110001011010110100111111001111111101110011111010001111110011111100111111001111111101110011111010001111111100010110101101001111110011111111011100111110100011111100111111001111110011111111011100111110100011111101011110 c5ad3f3fdcfa3f3f3f3fdcfa3fc5ad3f3fdcfa3f3f3f3fdcfa3f5e
UTF-8 鏑켕렩檍쇤增롈렩檍뻤鏑켕렩檍쇤增롈렩檍뻤^ 11101001100011111001000111101100101111001001010111101011101000001010100111100110101010101000110111101100100001111010010011100101101000101001111011101011101000011000100011101011101000001010100111100110101010101000110111101011101110111010010011101001100011111001000111101100101111001001010111101011101000001010100111100110101010101000110111101100100001111010010011100101101000101001111011101011101000011000100011101011101000001010100111100110101010101000110111101011101110111010010001011110 e98f91ecbc95eba0a9e6aa8dec87a4e5a29eeba188eba0a9e6aa8debbba4e98f91ecbc95eba0a9e6aa8dec87a4e5a29eeba188eba0a9e6aa8debbba45e
UHC 鏑켕렩檍쇤增롈렩檍뻤鏑켕렩檍쇤增롈렩檍뻤^ 1110111011101011110001001101000010001110101101111110010111100101101111001110100111110001111100101000111011001110100011101011011111100101111001011011101110111100111011101110101111000100110100001000111010110111111001011110010110111100111010011111000111110010100011101100111010001110101101111110010111100101101110111011110001011110 eeebc4d08eb7e5e5bce9f1f28ece8eb7e5e5bbbceeebc4d08eb7e5e5bce9f1f28ece8eb7e5e5bbbc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)