To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????®??????????^ 00111111001111110011111100111111101011100011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3fae3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 絶??若???????????^ 100100001110001000111111001111111000111011100001001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 90e23f3f8ee13f3f3f3f3f3f3f3f3f3f3f5e
EUC-JP 絶??若®??????????^ 1100000011100100001111110011111110111100111000111000111110100010111011100011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 c0e43f3fbce38fa2ee3f3f3f3f3f3f3f3f3f3f5e
UTF-8 絶귝퇍若®셽寧좑푴曆욃뇨捻귡죳^ 111001111011010110110110111010101011011110011101111011011000011110001101111010001000101110100101110000101010111011101100100001011011110111101111101001101010101011101100101000101001000111101101100100011011010011101111101001101000101111101100100110101000001111101011100001111010100011101111101001101010010011101010101101111010000111101100101000111011001101011110 e7b5b6eab79ded878de88ba5c2aeec85bdefa6aaeca291ed91b4efa68bec9a83eb87a8efa6a4eab7a1eca3b35e
UHC 絶귝퇍若®셽寧좑푴曆욃뇨捻귡죳^ 11101111101111101000001011100110101101111001111011100101101101001010001011100111100110011000001011100111101011001010000011101111101111101000001011100110101101111001111011100101101101001010001011100110111101111000001011101001101000011000111001011110 efbe82e6b79ee5b4a2e79982e7aca0efbe82e6b79ee5b4a2e6f782e9a18e5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)