To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 譚馴擠鬩礼濫雋械 11100110100111011001001111101001100111011010100111101001101010011001011111100111100101111001010011101000101100101000101001000010 e69d93e99da9e9a997e79794e8b28a42
EUC-JP 譚馴擠鬩礼濫雋械 11101011111111011100011011101011110110101010101111110010101010111100111011101001110011011111010011110000101101001011001110100011 ebfdc6ebdaabf2abcee9cdf4f0b4b3a3
UTF-8 譚馴擠鬩礼濫雋械 111010001010110110011010111010011010011010110100111001101001001110100000111010011010110010101001111001111010010010111100111001101011111110101011111010011001101110001011111001101010001010110000 e8ad9ae9a6b4e693a0e9aca9e7a4bce6bfabe99b8be6a2b0
UHC 譚馴???濫雋械 11010011110010011110001011111000001111110011111100111111110101011011110111110001111001101100110011111110 d3c9e2f83f3f3fd5bdf1e6ccfe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)