To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????B 00111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???揖??癒??B 001111110011111100111111100101110100101100111111001111111001011011111100001111110011111101000010 3f3f3f974b3f3f96fc3f3f42
EUC-JP ???揖?ł癒??B 0011111100111111001111111100110110101100001111111000111110101001110010001100110011111110001111110011111101000010 3f3f3fcdac3f8fa9c8ccfe3f3f42
UTF-8 令⑸㉡揖썹ł癒뀁돺B 111011111010011010101000111000101001000110111000111000111000100110100001111001101000111110010110111011001000110110111001110001011000001011100111100110011001001011101011100000001000000111101011100011111011101001000010 efa6a8e291b8e389a1e68f96ec8db9c582e79992eb8081eb8fba42
UHC 令⑸㉡揖썹ł癒뀁돺B 11100111101010011010100111101011101010001011001011101011111001111011110111100111101010011010100111101011101010001011001011101100100010011011110101000010 e7a9a9eba8b2ebe7bde7a9a9eba8b2ec89bd42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)