To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 讓ス譚台サ冶ェー驕懆カウ閼ア荳ケ譚醍カサB 11100110101010001011110111100110100111011001000111100100101110111001011011101000101010101011000011101001100000011001110011101000101101101011001111101000100001001011000111100100101110001011100111100110100111011001000111100111101101101011101101000010 e6a8bde69d91e4bb96e8aab0e9819ce8b6b3e884b1e4b8b9e69d91e7b6bb42
EUC-JP 讓ス譚台サ冶ェー驕懆カウ閼ア荳ケ譚醍カサB 1110110010101010100011101011110111101011111111011100001011100110100011101011101111001100111010101000111010101010100011101011000011110001111000011101100011101010100011101011011010001110101100111110111111100100100011101011000111101000101110101000111010111001111010111111110111000010111010011000111010110110100011101011101101000010 ecaa8ebdebfdc2e68ebbccea8eaa8eb0f1e1d8ea8eb68eb3efe48eb1e8ba8eb9ebfdc2e98eb68ebb42
UTF-8 讓ス譚台サ冶ェー驕懆カウ閼ア荳ケ譚醍カサB 11101000101011101001001111101111101111011011110111101000101011011001101011100101100011111011000011101111101111011011101111100101100001101011011011101111101111011010101011101111101111011011000011101001101010011001010111100110100001111000011011101111101111011011011011101111101111011011001111101001100101101011110011101111101111011011000111101000100011011011001111101111101111011011100111101000101011011001101011101001100001101000110111101111101111011011011011101111101111011011101101000010 e8ae93efbdbde8ad9ae58fb0efbdbbe586b6efbdaaefbdb0e9a995e68786efbdb6efbdb3e996bcefbdb1e88db3efbdb9e8ad9ae9868defbdb6efbdbb42
UHC 讓?譚台?冶??驕???閼?荳?譚醍??B 111001011101001100111111110100111100100111110111101110110011111111100101101001110011111100111111110011101111011000111111001111110011111111100100110110010011111111010100111001010011111111010011110010011111000010110101001111110011111101000010 e5d33fd3c9f7bb3fe5a73f3fcef63f3f3fe4d93fd4e53fd3c9f0b53f3f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)