To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 äâ¶ýæºç¦±väâ¶ýæºç¦±vB 111001001110001010110110111111011110011010111010111001111010011010110001011101101110010011100010101101101111110111100110101110101110011110100110101100010111011001000010 e4e2b6fde6bae7a6b176e4e2b6fde6bae7a6b17642
SJIS-WIN ??¶?????±v??¶?????±vB 00111111001111111000000111110111001111110011111100111111001111110011111110000001011111010111011000111111001111111000000111110111001111110011111100111111001111110011111110000001011111010111011001000010 3f3f81f73f3f3f3f3f817d763f3f81f73f3f3f3f3f817d7642
EUC-JP äâ¶ýæºç¦±väâ¶ýæºç¦±vB 1000111110101011101000111000111110101011101001001010001011111001100011111010101111110010100011111010100111000001100011111010001011101011100011111010101110101110100011111010001011000011101000011101111001110110100011111010101110100011100011111010101110100100101000101111100110001111101010111111001010001111101010011100000110001111101000101110101110001111101010111010111010001111101000101100001110100001110111100111011001000010 8faba38faba4a2f98fabf28fa9c18fa2eb8fabae8fa2c3a1de768faba38faba4a2f98fabf28fa9c18fa2eb8fabae8fa2c3a1de7642
UTF-8 äâ¶ýæºç¦±väâ¶ýæºç¦±vB 110000111010010011000011101000101100001010110110110000111011110111000011101001101100001010111010110000111010011111000010101001101100001010110001011101101100001110100100110000111010001011000010101101101100001110111101110000111010011011000010101110101100001110100111110000101010011011000010101100010111011001000010 c3a4c3a2c2b6c3bdc3a6c2bac3a7c2a6c2b176c3a4c3a2c2b6c3bdc3a6c2bac3a7c2a6c2b17642
UHC ??¶?æº??±v??¶?æº??±vB 0011111100111111101000101101001000111111101010011010000110101000101011000011111100111111101000011011111001110110001111110011111110100010110100100011111110101001101000011010100010101100001111110011111110100001101111100111011001000010 3f3fa2d23fa9a1a8ac3f3fa1be763f3fa2d23fa9a1a8ac3f3fa1be7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)