To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????R?????^[?????R?????^[^ 001111110011111100111111001111110011111101010010001111110011111100111111001111110011111101011110010110110011111100111111001111110011111100111111010100100011111100111111001111110011111100111111010111100101101101011110 3f3f3f3f3f523f3f3f3f3f5e5b3f3f3f3f3f523f3f3f3f3f5e5b5e
SJIS-WIN 鰲??藉?R鰲??藉?^[鰲??藉?R鰲??藉?^[^ 1110100111100000001111110011111111100101010100110011111101010010111010011110000000111111001111111110010101010011001111110101111001011011111010011110000000111111001111111110010101010011001111110101001011101001111000000011111100111111111001010101001100111111010111100101101101011110 e9e03f3fe5533f52e9e03f3fe5533f5e5be9e03f3fe5533f52e9e03f3fe5533f5e5b5e
EUC-JP 鰲??藉?R鰲??藉?^[鰲??藉?R鰲??藉?^[^ 1111001011100010001111110011111111101001101101000011111101010010111100101110001000111111001111111110100110110100001111110101111001011011111100101110001000111111001111111110100110110100001111110101001011110010111000100011111100111111111010011011010000111111010111100101101101011110 f2e23f3fe9b43f52f2e23f3fe9b43f5e5bf2e23f3fe9b43f52f2e23f3fe9b43f5e5b5e
UTF-8 鰲쑈뤌藉쐬R鰲쑈뤌藉쐬^[鰲쑈뤌藉쐬R鰲쑈뤌藉쐬^[^ 11101001101100001011001011101100100100011000100011101011101001001000110011101000100101111000100111101100100100001010110001010010111010011011000010110010111011001001000110001000111010111010010010001100111010001001011110001001111011001001000010101100010111100101101111101001101100001011001011101100100100011000100011101011101001001000110011101000100101111000100111101100100100001010110001010010111010011011000010110010111011001001000110001000111010111010010010001100111010001001011110001001111011001001000010101100010111100101101101011110 e9b0b2ec9188eba48ce89789ec90ac52e9b0b2ec9188eba48ce89789ec90ac5e5be9b0b2ec9188eba48ce89789ec90ac52e9b0b2ec9188eba48ce89789ec90ac5e5b5e
UHC 鰲쑈뤌藉쐬R鰲쑈뤌藉쐬^[鰲쑈뤌藉쐬R鰲쑈뤌藉쐬^[^ 1110100010100111101111101010010010001111101111001110110110111110101111011111110101010010111010001010011110111110101001001000111110111100111011011011111010111101111111010101111001011011111010001010011110111110101001001000111110111100111011011011111010111101111111010101001011101000101001111011111010100100100011111011110011101101101111101011110111111101010111100101101101011110 e8a7bea48fbcedbebdfd52e8a7bea48fbcedbebdfd5e5be8a7bea48fbcedbebdfd52e8a7bea48fbcedbebdfd5e5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)