To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 鷹?證?錚敎? 1001000111101001001111111110011010011010001111111110100001000010111110101100110100111111 91e93fe69a3fe842facd3f
EUC-JP 鷹?證?錚?? 11000010111010110011111111101011111110100011111111101111101000110011111100111111 c2eb3febfa3fefa33f3f
UTF-8 鷹렓證렖錚敎쓱 111010011011011110111001111010111010000010010011111010001010110110001001111010111010000010010110111010011000110010011010111001101001010110001110111011001001001110110001 e9b7b9eba093e8ad89eba096e98c9ae6958eec93b1
UHC 鷹렓證렖錚敎쓱 1110101111101101100011101010100011110001111110111000111010101011111011101011011011001110111001111011111010110011 ebed8ea8f1fb8eabeeb6cee7beb3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)