To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????^ 001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f5e
SJIS-WIN 貞???貞???^ 1001001011100101001111110011111100111111100100101110010100111111001111110011111101011110 92e53f3f3f92e53f3f3f5e
EUC-JP 貞???貞???^ 1100010011100111001111110011111100111111110001001110011100111111001111110011111101011110 c4e73f3f3fc4e73f3f3f5e
UTF-8 貞얩렠넸貞얩렠넵^ 11101000101100101001111011101100100101101010100111101011101000001010000011101011100001001011100011101000101100101001111011101100100101101010100111101011101000001010000011101011100001001011010101011110 e8b29eec96a9eba0a0eb84b8e8b29eec96a9eba0a0eb84b55e
UHC 貞얩렠넸貞얩렠넵^ 1110111111110110101111101110110110001110101100011011001111011110111011111111011010111110111011011000111010110001101100111101110001011110 eff6beed8eb1b3deeff6beed8eb1b3dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)