To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 艾??援ζ?幽??閻??異??癒??鈺 111001001000100000111111001111111000100110000111100000111100010000111111100101110100100000111111001111111110100010000101001111110011111110001000110110010011111100111111100101101111110000111111001111111111101111000100 e4883f3f898783c43f97483f3fe8853f3f88d93f3f96fc3f3ffbc4
EUC-JP 艾??援ζ?幽??閻??異??癒??鈺 11100111111010000011111100111111101100011110011110100110110001100011111111001101101010010011111100111111111011111110010100111111001111111011000011011011001111110011111111001100111111100011111100111111100011111110001111010101 e7e83f3fb1e7a6c63fcda93f3fefe53f3fb0db3f3fccfe3f3f8fe3d5
UTF-8 艾싳궇援ζ젔幽끹걶閻롮눘異루춯癒뀄맋鈺 1110100010001001101111101110110010001011101100111110101010110110100001111110011010001111101101001100111010110110111011001010000010010100111001011011100110111101111010111000000110111001111010101011000110110110111010011001011010111011111010111010000110101110111010111000100010011000111001111001010110110000111010111010001110101000111011001011011010101111111001111001100110010010111010111000000010000100111010111010011110001011111010011000100010111010 e889beec8bb3eab687e68fb4ceb6eca094e5b9bdeb81b9eab1b6e996bbeba1aeeb8898e795b0eba3a8ecb6afe79992eb8084eba78be988ba
UHC 艾싳궇援ζ젔幽끹걶閻롮눘異루춯癒뀄맋鈺 1110010011110101100110101110110010000010101000001110101010110101101001011110011010100000100100101110101011101011100001011110001110000001100111001110011110100010100011101110110010000111101100011110110010110110101101111110011110101101100011001110101110101000101100101110110110010000101000111110100010101101 e4f59aec82a0eab5a5e6a092eaeb85e3819ce7a28eec87b1ecb6b7e7ad8ceba8b2ed90a3e8ad

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)