To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲≪?踰e?蟻??魚 11100001100111111000000111100001001111111110011011111010100000101000010100111111100010110110000100111111001111111000101110011011 e19f81e13fe6fa82853f8b613f3f8b9b
EUC-JP 癲≪?踰e?蟻??魚 11100010101000011010001011100011001111111110110011111100101000111110010100111111101101011100001000111111001111111011010111111011 e2a1a2e33fecfca3e53fb5c23f3fb5fb
UTF-8 癲≪쉮踰e뿗蟻뤿말魚 111001111001100110110010111000101000100110101010111011001000100110101110111010001011100010110000111011111011110110000101111010111011111110010111111010001001111110111011111010111010010010111111111010111010011110010000111010011010110110011010 e799b2e289aaec89aee8b8b0efbd85ebbf97e89fbbeba4bfeba790e9ad9a
UHC 癲≪쉮踰e뿗蟻뤿말魚 1110111110100110101000011110110010011010100001101110101110110010101000111110010110010111100110101110101111111100100011111110101110111000101110111110010111100000 efa6a1ec9a86ebb2a3e5979aebfc8febb8bbe5e0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)