To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 潁??肄??汲源 100111111111000100111111001111111110001111100101001111110011111110001011100000101000110010111001 9ff13f3fe3e53f3f8b828cb9
EUC-JP 潁??肄??汲源 110111101111001100111111001111111110011011100111001111110011111110110101111000101011100010111011 def33f3fe6e73f3fb5e2b8bb
UTF-8 潁뺣굙肄믥퓖汲源 111001101011110110000001111010111011101010100011111010101011010110011001111010001000001010000100111010111010111110100101111011011001001110010110111001101011000110110010111001101011101010010000 e6bd81ebbaa3eab599e88284ebafa5ed9396e6b1b2e6ba90
UHC 潁뺣굙肄믥퓖汲源 11100111101110001001010111101011100000101000000111101100101111011001001011100111101111111000000111010000111000111110101010111001 e7b895eb8281ecbd92e7bf81d0e3eab9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)