To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????B 001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f42
SJIS-WIN 砥?莊?砥?莊?B 10010011011101010011111111100100101101010011111110010011011101010011111111100100101101010011111101000010 93753fe4b53f93753fe4b53f42
EUC-JP 砥?莊?砥?莊?B 11000101110101100011111111101000101101110011111111000101110101100011111111101000101101110011111101000010 c5d63fe8b73fc5d63fe8b73f42
UTF-8 砥렱莊렱砥렱莊렱B 11100111101000001010010111101011101000001011000111101000100011101000101011101011101000001011000111100111101000001010010111101011101000001011000111101000100011101000101011101011101000001011000101000010 e7a0a5eba0b1e88e8aeba0b1e7a0a5eba0b1e88e8aeba0b142
UHC 砥렱莊렱砥렱莊렱B 1111001010110010100011101011111011101101111101101000111010111110111100101011001010001110101111101110110111110110100011101011111001000010 f2b28ebeedf68ebef2b28ebeedf68ebe42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)