To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????­??????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111010110100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3fad3f3f3f3f3f3f5e
SJIS-WIN 項??航??嵬??嗚??言??辱ラ?壯??^ 100011011000000000111111001111111000110101110001001111110011111110011011110010100011111100111111100110100110101000111111001111111000110010111110001111110011111110010000010010101000001110001001001111111001101011100001001111110011111101011110 8d803f3f8d713f3f9bca3f3f9a6a3f3f8cbe3f3f904a83893f9ae13f3f5e
EUC-JP 項??航??嵬??嗚??言??辱ラ?壯??^ 101110011110000000111111001111111011100111010010001111110011111111010110110011000011111100111111110100111100101100111111001111111011100011000000001111110011111110111111101010111010010111101001001111111101010011100011001111110011111101011110 b9e03f3fb9d23f3fd6cc3f3fd3cb3f3fb8c03f3fbfaba5e93fd4e33f3f5e
UTF-8 項€뜥航⑶눈嵬뚲굺嗚뷁닣言⅛­辱ラ뀒壯숃뮅^ 111010011010000010000101111000101000001010101100111010111001110010100101111010001000100010101010111000101001000110110110111010111000100010001000111001011011010110101100111010111001101010110010111010101011010110111010111001011001011110011010111010111011011110000001111010111000101110100011111010001010100010000000111000101000010110011011110000101010110111101000101111101011000111100011100000111010100111101011100000001001001011100101101000111010111111101100100010001000001111101011101011101000010101011110 e9a085e282aceb9ca5e888aae291b6eb8888e5b5aceb9ab2eab5bae5979aebb781eb8ba3e8a880e2859bc2ade8beb1e383a9eb8092e5a3afec8883ebae855e
UHC 項€뜥航⑶눈嵬뚲굺嗚뷁닣言⅛­辱ラ뀒壯숃뮅^ 11111010101000111010001011100110100011011010100011111001111111101010100111101001101101001010101111101000111000111000110011101110100000101001100111100111111100001001010011101110100010001010001011100101111010111010100011111011101000011010100111101001101101001010101111101001100001011000110011101101111000001001100111101000100100101001010001011110 faa3a2e68da8f9fea9e9b4abe8e38cee8299e7f094ee88a2e5eba8fba1a9e9b4abe9858cede099e892945e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)