To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????S????????????????uB 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101010011001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110111010101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f533f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f7542
SJIS-WIN 偲、トセナ、ト蒔偲、ト自偲ヲト漆S偲、トセナ、ト蒔偲、ト璽偲ヲト漆uB 1000111011000011101001001100010010111110110001011010010011000100100011101010101010001110110000111010010011000100100011101010100110001110110000111010011011000100100011101011110101010011100011101100001110100100110001001011111011000101101001001100010010001110101010101000111011000011101001001100010010001110101000111000111011000011101001101100010010001110101111010111010101000010 8ec3a4c4bec5a4c48eaa8ec3a4c48ea98ec3a6c48ebd538ec3a4c4bec5a4c48eaa8ec3a4c48ea38ec3a6c48ebd7542
EUC-JP 偲、トセナ、ト蒔偲、ト自偲ヲト漆S偲、トセナ、ト蒔偲、ト璽偲ヲト漆uB 10111100110001011000111010100100100011101100010010001110101111101000111011000101100011101010010010001110110001001011110010101100101111001100010110001110101001001000111011000100101111001010101110111100110001011000111010100110100011101100010010111100101111110101001110111100110001011000111010100100100011101100010010001110101111101000111011000101100011101010010010001110110001001011110010101100101111001100010110001110101001001000111011000100101111001010010110111100110001011000111010100110100011101100010010111100101111110111010101000010 bcc58ea48ec48ebe8ec58ea48ec4bcacbcc58ea48ec4bcabbcc58ea68ec4bcbf53bcc58ea48ec48ebe8ec58ea48ec4bcacbcc58ea48ec4bca5bcc58ea68ec4bcbf7542
UTF-8 偲、トセナ、ト蒔偲、ト自偲ヲト漆S偲、トセナ、ト蒔偲、ト璽偲ヲト漆uB 111001011000000110110010111011111011110110100100111011111011111010000100111011111011110110111110111011111011111010000101111011111011110110100100111011111011111010000100111010001001001010010100111001011000000110110010111011111011110110100100111011111011111010000100111010001000011110101010111001011000000110110010111011111011110110100110111011111011111010000100111001101011110010000110010100111110010110000001101100101110111110111101101001001110111110111110100001001110111110111101101111101110111110111110100001011110111110111101101001001110111110111110100001001110100010010010100101001110010110000001101100101110111110111101101001001110111110111110100001001110011110010010101111011110010110000001101100101110111110111101101001101110111110111110100001001110011010111100100001100111010101000010 e581b2efbda4efbe84efbdbeefbe85efbda4efbe84e89294e581b2efbda4efbe84e887aae581b2efbda6efbe84e6bc8653e581b2efbda4efbe84efbdbeefbe85efbda4efbe84e89294e581b2efbda4efbe84e792bde581b2efbda6efbe84e6bc867542
UHC ???????蒔???自???漆S???????蒔???璽???漆uB 0011111100111111001111110011111100111111001111110011111111100011110010000011111100111111001111111110110110111011001111110011111100111111111101101101010001010011001111110011111100111111001111110011111100111111001111111110001111001000001111110011111100111111110111111101111000111111001111110011111111110110110101000111010101000010 3f3f3f3f3f3f3fe3c83f3f3fedbb3f3f3ff6d4533f3f3f3f3f3f3fe3c83f3f3fdfde3f3f3ff6d47542

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)