To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 鬘ォ驛∝エ溯」晢スセ鬮エ閧イ鄕呵ォ橸スセB 11101001101000011010101111101001100000111000000111100101101101001001111111101000101000111001110111101111101111011011111011101001101010111011010011101000100000101011001011111011101110001001100111101000101010111001111011101111101111011011111001000010 e9a1abe98381e5b49fe8a39defbdbee9abb4e882b2fbb899e8ab9eefbdbe42
EUC-JP 鬘ォ驛∝エ溯」晢スセ鬮エ閧イ?呵ォ橸スセB 11110010101000111000111010101011111100011110001110100010111001111000111010110100110111101110101010001110101000111101101011110001100011101011110110001110101111101111001010101101100011101011010011101111111000101000111010110010001111111101001011101010100011101010101111011100111100011000111010111101100011101011111001000010 f2a38eabf1e3a2e78eb4deea8ea3daf18ebd8ebef2ad8eb4efe28eb23fd2ea8eabdcf18ebd8ebe42
UTF-8 鬘ォ驛∝エ溯」晢スセ鬮エ閧イ鄕呵ォ橸スセB 11101001101011001001100011101111101111011010101111101001101010011001101111100010100010001001110111101111101111011011010011100110101110101010111111101111101111011010001111100110100110011010001011101111101111011011110111101111101111011011111011101001101011001010111011101111101111011011010011101001100101101010011111101111101111011011001011101001100001001001010111100101100100011011010111101111101111011010101111100110101010011011100011101111101111011011110111101111101111011011111001000010 e9ac98efbdabe9a99be2889defbdb4e6baafefbda3e699a2efbdbdefbdbee9acaeefbdb4e996a7efbdb2e98495e591b5efbdabe6a9b8efbdbdefbdbe42
UHC ??驛∝?溯????????鄕呵????B 0011111100111111111001101011111010100001111100000011111111100001101111010011111100111111001111110011111100111111001111110011111100111111111110101100000111001010101001110011111100111111001111110011111101000010 3f3fe6bea1f03fe1bd3f3f3f3f3f3f3f3ffac1caa73f3f3f3f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)