To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 邵貞恫霑ク邵剃 11100111101110001001001011100101100111001001100011101000101111111011100011100111101110001001001011100100 e7b892e59c98e8bfb8e7b892e4
EUC-JP 邵貞恫霑ク邵剃 1110111010111010110001001110011111010111111110001111000011000001100011101011100011101110101110101100010011100110 eebac4e7d7f8f0c18eb8eebac4e6
UTF-8 邵貞恫霑ク邵剃 111010011000001010110101111010001011001010011110111001101000000110101011111010011001110010010001111011111011110110111000111010011000001010110101111001011000100110000011 e982b5e8b29ee681abe99c91efbdb8e982b5e58983
UHC 邵貞?霑?邵剃 111000011101000011101111111101100011111111101111110001010011111111100001110100001111010011101111 e1d0eff63fefc53fe1d0f4ef

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)