To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 遶ェ陲匁錐閼ア蟆頑拷遶ェ陲匁錐閼ア蟆頑拷B 11100111101010111010101011101000101000101001011011100110100100001000110111101000100001001011000111100101101100001000101011100110100011011000100111100111101010111010101011101000101000101001011011100110100100001000110111101000100001001011000111100101101100001000101011100110100011011000100101000010 e7abaae8a296e6908de884b1e5b08ae68d89e7abaae8a296e6908de884b1e5b08ae68d8942
EUC-JP 遶ェ陲匁錐閼ア蟆頑拷遶ェ陲匁錐閼ア蟆頑拷B 1110111010101101100011101010101011110000101001001100110011101000101111111110110111101111111001001000111010110001111010101011001010110100111010001011100111101001111011101010110110001110101010101111000010100100110011001110100010111111111011011110111111100100100011101011000111101010101100101011010011101000101110011110100101000010 eead8eaaf0a4cce8bfedefe48eb1eab2b4e8b9e9eead8eaaf0a4cce8bfedefe48eb1eab2b4e8b9e942
UTF-8 遶ェ陲匁錐閼ア蟆頑拷遶ェ陲匁錐閼ア蟆頑拷B 11101001100000011011011011101111101111011010101011101001100110011011001011100101100011001000000111101001100011001001000011101001100101101011110011101111101111011011000111101000100111111000011011101001101000001001000111100110100010111011011111101001100000011011011011101111101111011010101011101001100110011011001011100101100011001000000111101001100011001001000011101001100101101011110011101111101111011011000111101000100111111000011011101001101000001001000111100110100010111011011101000010 e981b6efbdaae999b2e58c81e98c90e996bcefbdb1e89f86e9a091e68bb7e981b6efbdaae999b2e58c81e98c90e996bcefbdb1e89f86e9a091e68bb742
UHC ????錐閼??頑拷????錐閼??頑拷B 0011111100111111001111110011111111110101110111101110010011011001001111110011111111101000110101111100110110111000001111110011111100111111001111111111010111011110111001001101100100111111001111111110100011010111110011011011100001000010 3f3f3f3ff5dee4d93f3fe8d7cdb83f3f3f3ff5dee4d93f3fe8d7cdb842

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)