To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??A????????????A 00111111001111110100000100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000001 3f3f413f3f3f3f3f3f3f3f3f3f3f3f41
SJIS-WIN テシAテ姪ヲツ篠クテ愿堕偲淌シA 11000011101111000100000111000011100101101100001110100110110000101000111011000010101110001100001110011100110000111001000111000010100011101100001110011111110000111011110001000001 c3bc41c396c3a6c28ec2b8c39cc391c28ec39fc3bc41
EUC-JP テシAテ姪ヲツ篠クテ愿堕偲淌シA 100011101100001110001110101111000100000110001110110000111100110011000101100011101010011010001110110000101011110011000100100011101011100010001110110000111101100011000101110000101100010010111100110001011101111011000101100011101011110001000001 8ec38ebc418ec3ccc58ea68ec2bcc48eb88ec3d8c5c2c4bcc5dec58ebc41
UTF-8 テシAテ姪ヲツ篠クテ愿堕偲淌シA 1110111110111110100000111110111110111101101111000100000111101111101111101000001111100101101001111010101011101111101111011010011011101111101111101000001011100111101011111010000011101111101111011011100011101111101111101000001111100110100001001011111111100101101000001001010111100101100000011011001011100110101101111000110011101111101111011011110001000001 efbe83efbdbc41efbe83e5a7aaefbda6efbe82e7afa0efbdb8efbe83e684bfe5a095e581b2e6b78cefbdbc41
UHC ??A?姪??篠??愿????A 00111111001111110100000100111111111100101110101100111111001111111110000111000110001111110011111111101010101101000011111100111111001111110011111101000001 3f3f413ff2eb3f3fe1c63f3feab43f3f3f3f41

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)