To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 è±ºï½¶è¬ 1110100010110001101110101110111110111101101101101110100010101100 e8b1baefbdb6e8ac
SJIS-WIN ?±???¶?¬ 0011111110000001011111010011111100111111001111111000000111110111001111111000000111001010 3f817d3f3f3f81f73f81ca
EUC-JP 豺ï?¶è¬ 10001111101010111011001010100001110111101000111110100010111010111000111110101011110000010011111110100010111110011000111110101011101100101010001011001100 8fabb2a1de8fa2eb8fabc13fa2f98fabb2a2cc
UTF-8 è±ºï½¶è¬ 11000011101010001100001010110001110000101011101011000011101011111100001010111101110000101011011011000011101010001100001010101100 c3a8c2b1c2bac3afc2bdc2b6c3a8c2ac
UHC ?±º?½¶?? 001111111010000110111110101010001010110000111111101010001111011010100010110100100011111100111111 3fa1bea8ac3fa8f6a2d23f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)