To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 馭??揖??瑜??冶 1110100101100110001111110011111110010111010010110011111100111111111000001110111100111111001111111001011011101000 e9663f3f974b3f3fe0ef3f3f96e8
EUC-JP 馭??揖??瑜??冶 1111000111000111001111110011111111001101101011000011111100111111111000001111000100111111001111111100110011101010 f1c73f3fcdac3f3fe0f13f3fccea
UTF-8 馭곥룊揖뀐㏊瑜곸컝冶 111010011010011010101101111010101011001110100101111010111010001110001010111001101000111110010110111010111000000010010000111000111000111110001010111001111001000110011100111010101011001110111000111011001011101110011101111001011000011010110110 e9a6adeab3a5eba38ae68f96eb8090e38f8ae7919ceab3b8ecbb9de586b6
UHC 馭곥룊揖뀐㏊瑜곸컝冶 1110010111011111100000011110001110001111100010011110101111100111101100101110111110100111101101011110101110100101100000011110110010110000100010001110010110100111 e5df81e38f89ebe7b2efa7b5eba581ecb088e5a7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)