To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 弔?畯脈?淨?垣?虞?弔?畯脈?淨?垣?虞?B 1001001010100010001111111111101101101111100101101010110000111111100111111100010000111111100010100101111100111111100010111111000100111111100100101010001000111111111110110110111110010110101011000011111110011111110001000011111110001010010111110011111110001011111100010011111101000010 92a23ffb6f96ac3f9fc43f8a5f3f8bf13f92a23ffb6f96ac3f9fc43f8a5f3f8bf13f42
EUC-JP 弔?畯脈?淨?垣?虞?弔?畯脈?淨?垣?虞?B 11000100101001000011111110001111110011011011101111001100101011100011111111011110110001100011111110110011110000000011111110110110111100110011111111000100101001000011111110001111110011011011101111001100101011100011111111011110110001100011111110110011110000000011111110110110111100110011111101000010 c4a43f8fcdbbccae3fdec63fb3c03fb6f33fc4a43f8fcdbbccae3fdec63fb3c03fb6f33f42
UTF-8 弔렟畯脈歷淨렠垣렖虞렧弔렟畯脈歷淨렠垣렖虞렧B 11100101101111001001010011101011101000001001111111100111100101011010111111101000100001001000100011101111101001101000110011100110101101111010100011101011101000001010000011100101100111101010001111101011101000001001011011101000100110011001111011101011101000001010011111100101101111001001010011101011101000001001111111100111100101011010111111101000100001001000100011101111101001101000110011100110101101111010100011101011101000001010000011100101100111101010001111101011101000001001011011101000100110011001111011101011101000001010011101000010 e5bc94eba09fe795afe88488efa68ce6b7a8eba0a0e59ea3eba096e8999eeba0a7e5bc94eba09fe795afe88488efa68ce6b7a8eba0a0e59ea3eba096e8999eeba0a742
UHC 弔렟畯脈歷淨렠垣렖虞렧弔렟畯脈歷淨렠垣렖虞렧B 111100001100000010001110101100001111000111100001110110001110011011100110101110001110111111100100100011101011000111101010101011111000111010101011111010011110010110001110101101101111000011000000100011101011000011110001111000011101100011100110111001101011100011101111111001001000111010110001111010101010111110001110101010111110100111100101100011101011011001000010 f0c08eb0f1e1d8e6e6b8efe48eb1eaaf8eabe9e58eb6f0c08eb0f1e1d8e6e6b8efe48eb1eaaf8eabe9e58eb642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)