To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 讓ス譚台サ冶ェー鞜懆カウ鈔ア荳ケ襠醍カサB 11100110101010001011110111100110100111011001000111100100101110111001011011101000101010101011000011101000110111111001110011101000101101101011001111100111111000101011000111100100101110001011100111100101111110111001000111100111101101101011101101000010 e6a8bde69d91e4bb96e8aab0e8df9ce8b6b3e7e2b1e4b8b9e5fb91e7b6bb42
EUC-JP 讓ス譚台サ冶ェー鞜懆カウ鈔ア荳ケ襠醍カサB 1110110010101010100011101011110111101011111111011100001011100110100011101011101111001100111010101000111010101010100011101011000011110000111000011101100011101010100011101011011010001110101100111110111011100100100011101011000111101000101110101000111010111001111010101111110111000010111010011000111010110110100011101011101101000010 ecaa8ebdebfdc2e68ebbccea8eaa8eb0f0e1d8ea8eb68eb3eee48eb1e8ba8eb9eafdc2e98eb68ebb42
UTF-8 讓ス譚台サ冶ェー鞜懆カウ鈔ア荳ケ襠醍カサB 11101000101011101001001111101111101111011011110111101000101011011001101011100101100011111011000011101111101111011011101111100101100001101011011011101111101111011010101011101111101111011011000011101001100111101001110011100110100001111000011011101111101111011011011011101111101111011011001111101001100010001001010011101111101111011011000111101000100011011011001111101111101111011011100111101000101001011010000011101001100001101000110111101111101111011011011011101111101111011011101101000010 e8ae93efbdbde8ad9ae58fb0efbdbbe586b6efbdaaefbdb0e99e9ce68786efbdb6efbdb3e98894efbdb1e88db3efbdb9e8a5a0e9868defbdb6efbdbb42
UHC 讓?譚台?冶????????荳??醍??B 111001011101001100111111110100111100100111110111101110110011111111100101101001110011111100111111001111110011111100111111001111110011111100111111110101001110010100111111001111111111000010110101001111110011111101000010 e5d33fd3c9f7bb3fe5a73f3f3f3f3f3f3f3fd4e53f3ff0b53f3f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)