To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??E?????????E???????^ 001111110011111101000101001111110011111100111111001111110011111100111111001111110011111100111111010001010011111100111111001111110011111100111111001111110011111101011110 3f3f453f3f3f3f3f3f3f3f3f453f3f3f3f3f3f3f5e
SJIS-WIN 障?E除?彬除??魄障?E除?彬除??白^ 10001111111000010011111101000101100011111001110000111111100101010110101010001111100111000011111100111111111010011010111010001111111000010011111101000101100011111001110000111111100101010110101010001111100111000011111100111111100101001001001001011110 8fe13f458f9c3f956a8f9c3f3fe9ae8fe13f458f9c3f956a8f9c3f3f94925e
EUC-JP 障?E除?彬除??魄障?E除?彬除??白^ 10111110111000110011111101000101101111011111110000111111110010011100101110111101111111000011111100111111111100101011000010111110111000110011111101000101101111011111110000111111110010011100101110111101111111000011111100111111110001111111001001011110 bee33f45bdfc3fc9cbbdfc3f3ff2b0bee33f45bdfc3fc9cbbdfc3f3fc7f25e
UTF-8 障렚E除곈彬除곁렠魄障렚E除곈彬除곁렠白^ 111010011001101010011100111010111010000010011010010001011110100110011001101001001110101010110011100010001110010110111101101011001110100110011001101001001110101010110011100000011110101110100000101000001110100110101101100001001110100110011010100111001110101110100000100110100100010111101001100110011010010011101010101100111000100011100101101111011010110011101001100110011010010011101010101100111000000111101011101000001010000011100111100110011011110101011110 e99a9ceba09a45e999a4eab388e5bdace999a4eab381eba0a0e9ad84e99a9ceba09a45e999a4eab388e5bdace999a4eab381eba0a0e799bd5e
UHC 障렚E除곈彬除곁렠魄障렚E除곈彬除곁렠白^ 111011101010000110001110101011010100010111110000101101101011000011101001110111101010111111110000101101101011000011100111100011101011000111011011110111101110111010100001100011101010110101000101111100001011011010110000111010011101111010101111111100001011011010110000111001111000111010110001110110111101110001011110 eea18ead45f0b6b0e9deaff0b6b0e78eb1dbdeeea18ead45f0b6b0e9deaff0b6b0e78eb1dbdc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)