To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??F^h??F^fN}??F^h??F^fN{^ 00111111001111110100011001011110011010000011111100111111010001100101111001100110010011100111110100111111001111110100011001011110011010000011111100111111010001100101111001100110010011100111101101011110 3f3f465e683f3f465e664e7d3f3f465e683f3f465e664e7b5e
SJIS-WIN 騷淞F^h騷淞F^fN}騷淞F^h騷淞F^fN{^ 111010010111101010011111110000100100011001011110011010001110100101111010100111111100001001000110010111100110011001001110011111011110100101111010100111111100001001000110010111100110100011101001011110101001111111000010010001100101111001100110010011100111101101011110 e97a9fc2465e68e97a9fc2465e664e7de97a9fc2465e68e97a9fc2465e664e7b5e
EUC-JP 騷淞F^h騷淞F^fN}騷淞F^h騷淞F^fN{^ 111100011101101111011110110001000100011001011110011010001111000111011011110111101100010001000110010111100110011001001110011111011111000111011011110111101100010001000110010111100110100011110001110110111101111011000100010001100101111001100110010011100111101101011110 f1dbdec4465e68f1dbdec4465e664e7df1dbdec4465e68f1dbdec4465e664e7b5e
UTF-8 騷淞F^h騷淞F^fN}騷淞F^h騷淞F^fN{^ 1110100110101000101101111110011010110111100111100100011001011110011010001110100110101000101101111110011010110111100111100100011001011110011001100100111001111101111010011010100010110111111001101011011110011110010001100101111001101000111010011010100010110111111001101011011110011110010001100101111001100110010011100111101101011110 e9a8b7e6b79e465e68e9a8b7e6b79e465e664e7de9a8b7e6b79e465e68e9a8b7e6b79e465e664e7b5e
UHC 騷淞F^h騷淞F^fN}騷淞F^h騷淞F^fN{^ 111000011101001111100001111001110100011001011110011010001110000111010011111000011110011101000110010111100110011001001110011111011110000111010011111000011110011101000110010111100110100011100001110100111110000111100111010001100101111001100110010011100111101101011110 e1d3e1e7465e68e1d3e1e7465e664e7de1d3e1e7465e68e1d3e1e7465e664e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)