To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲〓?徇??怨??櫻?????諛??猥??誼 111000011001111110000001101011000011111110011100011011010011111100111111100010011000010100111111001111111001111101001110001111110011111100111111001111110011111111100110100001110011111100111111111000001100111000111111001111111000101101100010 e19f81ac3f9c6d3f3f89853f3f9f4e3f3f3f3f3fe6873f3fe0ce3f3f8b62
EUC-JP 癲〓?徇??怨??櫻??堉??諛??猥??誼 1110001010100001101000101010111000111111110101111100111000111111001111111011000111100101001111110011111111011101101011110011111100111111100011111011011111111101001111110011111111101011111001110011111100111111111000001101000000111111001111111011010111000011 e2a1a2ae3fd7ce3f3fb1e53f3fddaf3f3f8fb7fd3f3febe73f3fe0d03f3fb5c3
UTF-8 癲〓쵎徇됵쭓怨뺤젵櫻뗰퐣堉뉒뿆諛깃퐷猥됰씭誼 111001111001100110110010111000111000000010010011111011001011010110001110111001011011111010000111111010111001000010110101111011001010110110010011111001101000000010101000111010111011101010100100111011001010000010110101111001101010101110111011111010111001011110110000111011011001000010100011111001011010000010001001111010111000100110010010111010111011111110000110111010001010101110011011111010101011100110000011111011011001000010110111111001111000110010100101111010111001000010110000111011001001010010101101111010001010101010111100 e799b2e38093ecb58ee5be87eb90b5ecad93e680a8ebbaa4eca0b5e6abbbeb97b0ed90a3e5a089eb8992ebbf86e8ab9beab983ed90b7e78ca5eb90b0ec94ade8aabc
UHC 癲〓쵎徇됵쭓怨뺤젵櫻뗰퐣堉뉒뿆諛깃퐷猥됰씭誼 1110111110100110101000011110101110101100100100001110001011011111100010011110111110100111100010111110101010110011100101011110110010100000101010011110010110100001100010111110111110111101100011001110101110111100100001111110011110010111100011011110101110110000101100011110101010111101101000001110100011100101100010011110101110011101101111101110101111111110 efa6a1ebac90e2df89efa78beab395eca0a9e5a18befbd8cebbc87e7978debb0b1eabda0e8e589eb9dbeebfe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)