To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????T 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101010100 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f54
SJIS-WIN 偲ヲト自偲、ト治偲、トコナヲト蒔偲、ト竺偲、ト七T 1000111011000011101001101100010010001110101010011000111011000011101001001100010010001110101000011000111011000011101001001100010010111010110001011010011011000100100011101010101010001110110000111010010011000100100011101011000110001110110000111010010011000100100011101011010101010100 8ec3a6c48ea98ec3a4c48ea18ec3a4c4bac5a6c48eaa8ec3a4c48eb18ec3a4c48eb554
EUC-JP 偲ヲト自偲、ト治偲、トコナヲト蒔偲、ト竺偲、ト七T 10111100110001011000111010100110100011101100010010111100101010111011110011000101100011101010010010001110110001001011110010100011101111001100010110001110101001001000111011000100100011101011101010001110110001011000111010100110100011101100010010111100101011001011110011000101100011101010010010001110110001001011110010110011101111001100010110001110101001001000111011000100101111001011011101010100 bcc58ea68ec4bcabbcc58ea48ec4bca3bcc58ea48ec48eba8ec58ea68ec4bcacbcc58ea48ec4bcb3bcc58ea48ec4bcb754
UTF-8 偲ヲト自偲、ト治偲、トコナヲト蒔偲、ト竺偲、ト七T 11100101100000011011001011101111101111011010011011101111101111101000010011101000100001111010101011100101100000011011001011101111101111011010010011101111101111101000010011100110101100101011101111100101100000011011001011101111101111011010010011101111101111101000010011101111101111011011101011101111101111101000010111101111101111011010011011101111101111101000010011101000100100101001010011100101100000011011001011101111101111011010010011101111101111101000010011100111101010111011101011100101100000011011001011101111101111011010010011101111101111101000010011100100101110001000001101010100 e581b2efbda6efbe84e887aae581b2efbda4efbe84e6b2bbe581b2efbda4efbe84efbdbaefbe85efbda6efbe84e89294e581b2efbda4efbe84e7abbae581b2efbda4efbe84e4b88354
UHC ???自???治???????蒔???竺???七T 001111110011111100111111111011011011101100111111001111110011111111110110101111010011111100111111001111110011111100111111001111110011111111100011110010000011111100111111001111111111010111100111001111110011111100111111111101101101001001010100 3f3f3fedbb3f3f3ff6bd3f3f3f3f3f3f3fe3c83f3f3ff5e73f3f3ff6d254

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)