To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN 爾畔b辛?災???v爾畔b辛?災???vB 10001110101000101001010011001000100000101000001010010000011010000011111110001101110100000011111100111111001111110111011010001110101000101001010011001000100000101000001010010000011010000011111110001101110100000011111100111111001111110111011001000010 8ea294c8828290683f8dd03f3f3f768ea294c8828290683f8dd03f3f3f7642
EUC-JP 爾畔b辛?災???v爾畔b辛?災???vB 10111100101001001100100011001010101000111110001010111111110010010011111110111010110100100011111100111111001111110111011010111100101001001100100011001010101000111110001010111111110010010011111110111010110100100011111100111111001111110111011001000010 bca4c8caa3e2bfc93fbad23f3f3f76bca4c8caa3e2bfc93fbad23f3f3f7642
UTF-8 爾畔b辛렓災고렣쁩v爾畔b辛렓災고렣쁩vB 111001111000100010111110111001111001010110010100111011111011110110000010111010001011111010011011111010111010000010010011111001111000000110111101111010101011001110100000111010111010000010100011111011001000000110101001011101101110011110001000101111101110011110010101100101001110111110111101100000101110100010111110100110111110101110100000100100111110011110000001101111011110101010110011101000001110101110100000101000111110110010000001101010010111011001000010 e788bee79594efbd82e8be9beba093e781bdeab3a0eba0a3ec81a976e788bee79594efbd82e8be9beba093e781bdeab3a0eba0a3ec81a97642
UHC 爾畔b辛렓災고렣쁩v爾畔b辛렓災고렣쁩vB 111011001011001111011010111011011010001111100010111000111111010010001110101010001110111010101100101100001110110110001110101101001011101111011110011101101110110010110011110110101110110110100011111000101110001111110100100011101010100011101110101011001011000011101101100011101011010010111011110111100111011001000010 ecb3daeda3e2e3f48ea8eeacb0ed8eb4bbde76ecb3daeda3e2e3f48ea8eeacb0ed8eb4bbde7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)