To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 中??舵∵夭??牟 1001001010000110001111110011111110010001110001111000000111100110100110101110111000111111001111111001011010110100 92863f3f91c781e69aee3f3f96b4
EUC-JP 中??舵∵夭??牟 1100001111100110001111110011111111000010110010011010001011101000110101001111000000111111001111111100110010110110 c3e63f3fc2c9a2e8d4f03f3fccb6
UTF-8 中렑뤈舵∵夭쫸몹牟 111001001011100010101101111010111010000010010001111010111010010010001000111010001000100010110101111000101000100010110101111001011010010010101101111011001010101110111000111010111010101010111001111001111000100110011111 e4b8adeba091eba488e888b5e288b5e5a4adecabb8ebaab9e7899f
UHC 中렑뤈舵∵夭쫸몹牟 111100011110100110001110101001101000111110111000111101101110110010100001111100011110100011101100101001101000111110111000111101111101100110111111 f1e98ea68fb8f6eca1f1e8eca68fb8f7d9bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)