To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 鹽??輿??葯? 1110101001100100001111110011111110010111011000000011111100111111111001001101111000111111 ea643f3f97603f3fe4de3f
EUC-JP 鹽??輿??葯? 1111001111000101001111110011111111001101110000010011111100111111111010001110000000111111 f3c53f3fcdc13f3fe8e03f
UTF-8 鹽땴룲輿걦껿葯긠 111010011011100110111101111010111001010110110100111010111010001110110010111010001011110010111111111010101011000110100110111010101011101110111111111010001001000110101111111010101011100010100000 e9b9bdeb95b4eba3b2e8bcbfeab1a6eabbbfe891afeab8a0
UHC 鹽땴룲輿걦껿葯긠 11100111101001001000101110001010100011111010011111100110101010111000000110001111100001000101010011100101101101011000001101100100 e7a48b8a8fa7e6ab818f8454e5b58364

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)