To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲?????松??嚥〓????諛??筌 11100001100111110011111100111111001111110011111100111111100011111011110000111111001111111001101010001011100000011010110000111111001111110011111100111111111001101000011100111111001111111110001010100011 e19f3f3f3f3f3f8fbc3f3f9a8b81ac3f3f3f3fe6873f3fe2a3
EUC-JP 癲??靷??松??嚥〓?瑗??諛??筌 1110001010100001001111110011111110001111111001111011110100111111001111111011111010111110001111110011111111010011111010111010001010101110001111111000111111001100110000000011111100111111111010111110011100111111001111111110010010100101 e2a13f3f8fe7bd3f3fbebe3f3fd3eba2ae3f8fccc03f3febe73f3fe4a5
UTF-8 癲앷쑬靷뽪끽松썬렃嚥〓뀈瑗삣넇諛몃㎥筌 111001111001100110110010111011001001010110110111111011001001000110101100111010011001110110110111111010111011110110101010111010111000000110111101111001101001110110111110111011001000110110101100111010111010000010000011111001011001101010100101111000111000000010010011111010111000000010001000111001111001000110010111111011001000001010100011111010111000010010000111111010001010101110011011111010111010101010000011111000111000111010100101111001111010110110001100 e799b2ec95b7ec91ace99db7ebbdaaeb81bde69dbeec8daceba083e59aa5e38093eb8088e79197ec82a3eb8487e8ab9bebaa83e38ea5e7ad8c
UHC 癲앷쑬靷뽪끽松썬렃嚥〓뀈瑗삣넇諛몃㎥筌 1110111110100110100111011110101010111110101010001110110011100110100101101110011010110011101000111110000111100110101111011110001110001110100111011110011010111111101000011110101110000101100001001110101010111100101110111110010110000110100101111110101110110000101110001110101110100111101010011110111110100111 efa69deabea8ece696e6b3a3e1e6bde38e9de6bfa1eb8584eabcbbe58697ebb0b8eba7a9efa7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)