To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 痢면뱄n}痢면뱄n{^ 1110111110100111101001011110101110101001101101001110101110110001100001000110111001111101111011111010011110100101111010111010100110110100111010111011000110000100011011100111101101011110 efa7a5eba9b4ebb1846e7defa7a5eba9b4ebb1846e7b5e
SJIS-WIN ?§¥??´?±?n}?§¥??´?±?n{^ 00111111100000011001100010000001100011110011111100111111100000010100110000111111100000010111110100111111011011100111110100111111100000011001100010000001100011110011111100111111100000010100110000111111100000010111110100111111011011100111101101011110 3f8198818f3f3f814c3f817d3f6e7d3f8198818f3f3f814c3f817d3f6e7b5e
EUC-JP ï§?ë©´ë±?n}ï§?ë©´ë±?n{^ 100011111010101111000001101000011111100000111111100011111010101110110011100011111010001011101101101000011010110110001111101010111011001110100001110111100011111101101110011111011000111110101011110000011010000111111000001111111000111110101011101100111000111110100010111011011010000110101101100011111010101110110011101000011101111000111111011011100111101101011110 8fabc1a1f83f8fabb38fa2eda1ad8fabb3a1de3f6e7d8fabc1a1f83f8fabb38fa2eda1ad8fabb3a1de3f6e7b5e
UTF-8 痢면뱄n}痢면뱄n{^ 1100001110101111110000101010011111000010101001011100001110101011110000101010100111000010101101001100001110101011110000101011000111000010100001000110111001111101110000111010111111000010101001111100001010100101110000111010101111000010101010011100001010110100110000111010101111000010101100011100001010000100011011100111101101011110 c3afc2a7c2a5c3abc2a9c2b4c3abc2b1c2846e7dc3afc2a7c2a5c3abc2a9c2b4c3abc2b1c2846e7b5e
UHC ?§???´?±?n}?§???´?±?n{^ 0011111110100001110101110011111100111111001111111010001010100101001111111010000110111110001111110110111001111101001111111010000111010111001111110011111100111111101000101010010100111111101000011011111000111111011011100111101101011110 3fa1d73f3f3fa2a53fa1be3f6e7d3fa1d73f3f3fa2a53fa1be3f6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)