To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 禮썽ëþ¾è½ëô•겱療경ЬB 111011111010011010110110111011001000110110111101111010111111111010111110111010001011110110011101111010111111010010010101111010101011001010110001111011111010011110000001111010101011001010111101110100001010110001000010 efa6b6ec8dbdebfebee8bd9debf495eab2b1efa781eab2bdd0ac42
SJIS-WIN ??¶??????????????±?§?????¬B 00111111001111111000000111110111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111110000001011111010011111110000001100110000011111100111111001111110011111100111111100000011100101001000010 3f3f81f73f3f3f3f3f3f3f3f3f3f3f3f3f3f817d3f81983f3f3f3f3f81ca42
EUC-JP 禮ì??ëþ?è??ëô?ê?±ï§?ê???¬B 1000111110101011110000011000111110100010110000111010001011111001100011111010101111000000001111110011111110001111101010111011001110001111101010011101000000111111100011111010101110110010001111110011111110001111101010111011001110001111101010111101010000111111100011111010101110110100001111111010000111011110100011111010101111000001101000011111100000111111100011111010101110110100001111110011111100111111101000101100110001000010 8fabc18fa2c3a2f98fabc03f3f8fabb38fa9d03f8fabb23f3f8fabb38fabd43f8fabb43fa1de8fabc1a1f83f8fabb43f3f3fa2cc42
UTF-8 禮썽ëþ¾è½ëô•겱療경ЬB 1100001110101111110000101010011011000010101101101100001110101100110000101000110111000010101111011100001110101011110000111011111011000010101111101100001110101000110000101011110111000010100111011100001110101011110000111011010011000010100101011100001110101010110000101011001011000010101100011100001110101111110000101010011111000010100000011100001110101010110000101011001011000010101111011100001110010000110000101010110001000010 c3afc2a6c2b6c3acc28dc2bdc3abc3bec2bec3a8c2bdc29dc3abc3b4c295c3aac2b2c2b1c3afc2a7c281c3aac2b2c2bdc390c2ac42
UHC ??¶??½?þ¾?½?????²±?§??²½Ð?B 0011111100111111101000101101001000111111001111111010100011110110001111111010100110101101101010001111101000111111101010001111011000111111001111110011111100111111001111111010100111110111101000011011111000111111101000011101011100111111001111111010100111110111101010001111011010101000101000100011111101000010 3f3fa2d23f3fa8f63fa9ada8fa3fa8f63f3f3f3f3fa9f7a1be3fa1d73f3fa9f7a8f6a8a23f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)