To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 臾?М臾щぞ維??臾?М臾щぞ維??B 11100100011010110011111110000100010011011110010001101011100001001000101110000010101111001000100011011011001111110011111111100100011010110011111110000100010011011110010001101011100001001000101110000010101111001000100011011011001111110011111101000010 e46b3f844de46b848b82bc88db3f3fe46b3f844de46b848b82bc88db3f3f42
EUC-JP 臾?М臾щぞ維??臾?М臾щぞ維??B 11100111110011000011111110100111101011101110011111001100101001111110101110100100101111101011000011011101001111110011111111100111110011000011111110100111101011101110011111001100101001111110101110100100101111101011000011011101001111110011111101000010 e7cc3fa7aee7cca7eba4beb0dd3f3fe7cc3fa7aee7cca7eba4beb0dd3f3f42
UTF-8 臾뀀М臾щぞ維묐춯臾뀀М臾щぞ維묐춯B 111010001000011110111110111010111000000010000000110100001001110011101000100001111011111011010001100010011110001110000001100111101110011110110110101011011110101110101100100100001110110010110110101011111110100010000111101111101110101110000000100000001101000010011100111010001000011110111110110100011000100111100011100000011001111011100111101101101010110111101011101011001001000011101100101101101010111101000010 e887beeb8080d09ce887bed189e3819ee7b6adebac90ecb6afe887beeb8080d09ce887bed189e3819ee7b6adebac90ecb6af42
UHC 臾뀀М臾щぞ維묐춯臾뀀М臾щぞ維묐춯B 11101011101011001011001011101011101011001010111011101011101011001010110011101011101010101011111011101011101010111001000111101011101011011000110011101011101011001011001011101011101011001010111011101011101011001010110011101011101010101011111011101011101010111001000111101011101011011000110001000010 ebacb2ebacaeebacacebaabeebab91ebad8cebacb2ebacaeebacacebaabeebab91ebad8c42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)