To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 陟搾スケ驕倅ソカ雖迎陟搾スケ驕倅ソカ雖鶏^ 111010001010000010001101111011111011110110111001111010011000000110011000111001001011111110110110111001011010101110001100011111011110100010100000100011011110111110111101101110011110100110000001100110001110010010111111101101101110010110101011100011000111101101011110 e8a08defbdb9e98198e4bfb6e5ab8c7de8a08defbdb9e98198e4bfb6e5ab8c7b5e
EUC-JP 陟搾スケ驕倅ソカ雖迎陟搾スケ驕倅ソカ雖鶏^ 1111000010100010101110101111000110001110101111011000111010111001111100011110000111010000111001101000111010111111100011101011011011101010101011011011011111011110111100001010001010111010111100011000111010111101100011101011100111110001111000011101000011100110100011101011111110001110101101101110101010101101101101111101110001011110 f0a2baf18ebd8eb9f1e1d0e68ebf8eb6eaadb7def0a2baf18ebd8eb9f1e1d0e68ebf8eb6eaadb7dc5e
UTF-8 陟搾スケ驕倅ソカ雖迎陟搾スケ驕倅ソカ雖鶏^ 11101001100110011001111111100110100100001011111011101111101111011011110111101111101111011011100111101001101010011001010111100101100000001000010111101111101111011011111111101111101111011011011011101001100110111001011011101000101111111000111011101001100110011001111111100110100100001011111011101111101111011011110111101111101111011011100111101001101010011001010111100101100000001000010111101111101111011011111111101111101111011011011011101001100110111001011011101001101101101000111101011110 e9999fe690beefbdbdefbdb9e9a995e58085efbdbfefbdb6e99b96e8bf8ee9999fe690beefbdbdefbdb9e9a995e58085efbdbfefbdb6e99b96e9b68f5e
UHC 陟搾??驕???雖迎陟搾??驕???雖?^ 111101001011001111110011101101100011111100111111110011101111011000111111001111110011111111100010110011001110011111001010111101001011001111110011101101100011111100111111110011101111011000111111001111110011111111100010110011000011111101011110 f4b3f3b63f3fcef63f3f3fe2cce7caf4b3f3b63f3fcef63f3f3fe2cc3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)