To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???揖ζ?儒??葯 0011111100111111001111111001011101001011100000111100010000111111100011101111001000111111001111111110010011011110 3f3f3f974b83c43f8ef23f3fe4de
EUC-JP 艅??揖ζˇ儒??葯 100011111101011011111101001111110011111111001101101011001010011011000110100011111010001010110000101111001111010000111111001111111110100011100000 8fd6fd3f3fcdaca6c68fa2b0bcf43f3fe8e0
UTF-8 艅덈엪揖ζˇ儒묒쐺葯 11101000100010011000010111101011100011011000100011101100100101111010101011100110100011111001011011001110101101101100101110000111111001011000010010010010111010111010110010010010111011001001000010111010111010001001000110101111 e88985eb8d88ec97aae68f96ceb6cb87e58492ebac92ec90bae891af
UHC 艅덈엪揖ζˇ儒묒쐺葯 1110011010101001100010001110101110011110100000111110101111100111101001011110011010100010101001111110101011100011100100011110110010011100100111001110010110110101 e6a988eb9e83ebe7a5e6a2a7eae391ec9c9ce5b5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)