To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 移??勘?基??諸 10001000110110100011111100111111100010101010100000111111100010101110111000111111001111111000111110010100 88da3f3f8aa83f8aee3f3f8f94
EUC-JP 移??勘?基??諸 10110000110111000011111100111111101101001010101000111111101101001111000000111111001111111011110111110100 b0dc3f3fb4aa3fb4f03f3fbdf4
UTF-8 移쇨렠勘렚基렰렖諸 111001111010011110111011111011001000011110101000111010111010000010100000111001011000101110011000111010111010000010011010111001011001111110111010111010111010000010110000111010111010000010010110111010001010101110111000 e7a7bbec87a8eba0a0e58b98eba09ae59fbaeba0b0eba096e8abb8
UHC 移쇨렠勘렚基렰렖諸 111011001011100110111100111010101000111010110001110010101110101110001110101011011101000011110001100011101011110110001110101010111111000010110011 ecb9bcea8eb1caeb8eadd0f18ebd8eabf0b3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)