To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????^ 00111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f5e
SJIS-WIN 単端尊辿卒息^ 10010010010100001001001001011011100100011011100010010010010010001001000110110010100100011010011101011110 9250925b91b8924891b291a75e
EUC-JP 単端尊辿卒息^ 11000011101100011100001110111100110000101011101011000011101010011100001010110100110000101010100101011110 c3b1c3bcc2bac3a9c2b4c2a95e
UTF-8 単端尊辿卒息^ 11100101100011011001100011100111101010111010111111100101101100001000101011101000101111101011111111100101100011011001001011100110100000011010111101011110 e58d98e7abafe5b08ae8bebfe58d92e681af5e
UHC ?端尊?卒息^ 0011111111010011101011101111000011101110001111111111000011101111111000111101001101011110 3fd3aef0ee3ff0efe3d35e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)