To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 兀??宜?兀??宜?B 100110010101100100111111001111111000101101011000001111111001100101011001001111110011111110001011010110000011111101000010 99593f3f8b583f99593f3f8b583f42
EUC-JP 兀??宜?兀??宜?B 110100011011101000111111001111111011010110111001001111111101000110111010001111110011111110110101101110010011111101000010 d1ba3f3fb5b93fd1ba3f3fb5b93f42
UTF-8 兀덄탢宜괆兀덄탢宜괆B 11100101100001011000000011101011100011011000010011101101100000111010001011100101101011101001110011101010101101001000011011100101100001011000000011101011100011011000010011101101100000111010001011100101101011101001110011101010101101001000011001000010 e58580eb8d84ed83a2e5ae9ceab486e58580eb8d84ed83a2e5ae9ceab48642
UHC 兀덄탢宜괆兀덄탢宜괆B 111010001011010010001000111001111011010110000101111010111111000110110000111111101110100010110100100010001110011110110101100001011110101111110001101100001111111001000010 e8b488e7b585ebf1b0fee8b488e7b585ebf1b0fe42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)