To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 髢シ繧域。晁ャ梧ッウ髢シ繧域。晁ャ梧ッャ^ 11101001100101101011110011100011100000101000100011100110101000011001110111101000101011001000110011100110101011111011001111101001100101101011110011100011100000101000100011100110101000011001110111101000101011001000110011100110101011111010110001011110 e996bce38288e6a19de8ac8ce6afb3e996bce38288e6a19de8ac8ce6afac5e
EUC-JP 髢シ繧域。晁ャ梧ッウ髢シ繧域。晁ャ梧ッャ^ 1111000111110110100011101011110011100101111000101011000011101000100011101010000111011010111010101000111010101100101110001110100010001110101011111000111010110011111100011111011010001110101111001110010111100010101100001110100010001110101000011101101011101010100011101010110010111000111010001000111010101111100011101010110001011110 f1f68ebce5e2b0e88ea1daea8eacb8e88eaf8eb3f1f68ebce5e2b0e88ea1daea8eacb8e88eaf8eac5e
UTF-8 髢シ繧域。晁ャ梧ッウ髢シ繧域。晁ャ梧ッャ^ 11101001101010111010001011101111101111011011110011100111101110011010011111100101100111111001111111101111101111011010000111100110100110011000000111101111101111011010110011100110101000101010011111101111101111011010111111101111101111011011001111101001101010111010001011101111101111011011110011100111101110011010011111100101100111111001111111101111101111011010000111100110100110011000000111101111101111011010110011100110101000101010011111101111101111011010111111101111101111011010110001011110 e9aba2efbdbce7b9a7e59f9fefbda1e69981efbdace6a2a7efbdafefbdb3e9aba2efbdbce7b9a7e59f9fefbda1e69981efbdace6a2a7efbdafefbdac5e
UHC ???域?晁?梧?????域?晁?梧??^ 001111110011111100111111111001101011010000111111111100001100010100111111111001111111110000111111001111110011111100111111001111111110011010110100001111111111000011000101001111111110011111111100001111110011111101011110 3f3f3fe6b43ff0c53fe7fc3f3f3f3f3fe6b43ff0c53fe7fc3f3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)