To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鬩幢ス、鞜橸スョ鬩滂スャ鬩幢ス、鞜橸スョ頷 1110100110101001100110111110111110111101101001001110100011011111100111101110111110111101101011101110100110101001100111111110111110111101101011001110100110101001100110111110111110111101101001001110100011011111100111101110111110111101101011101110100011110101 e9a99befbda4e8df9eefbdaee9a99fefbdace9a99befbda4e8df9eefbdaee8f5
EUC-JP 鬩幢ス、鞜橸スョ鬩滂スャ鬩幢ス、鞜橸スョ頷 111100101010101111010110111100011000111010111101100011101010010011110000111000011101110011110001100011101011110110001110101011101111001010101011110111101111000110001110101111011000111010101100111100101010101111010110111100011000111010111101100011101010010011110000111000011101110011110001100011101011110110001110101011101111000011110111 f2abd6f18ebd8ea4f0e1dcf18ebd8eaef2abdef18ebd8eacf2abd6f18ebd8ea4f0e1dcf18ebd8eaef0f7
UTF-8 鬩幢ス、鞜橸スョ鬩滂スャ鬩幢ス、鞜橸スョ頷 111010011010110010101001111001011011100110100010111011111011110110111101111011111011110110100100111010011001111010011100111001101010100110111000111011111011110110111101111011111011110110101110111010011010110010101001111001101011101110000010111011111011110110111101111011111011110110101100111010011010110010101001111001011011100110100010111011111011110110111101111011111011110110100100111010011001111010011100111001101010100110111000111011111011110110111101111011111011110110101110111010011010000010110111 e9aca9e5b9a2efbdbdefbda4e99e9ce6a9b8efbdbdefbdaee9aca9e6bb82efbdbdefbdace9aca9e5b9a2efbdbdefbda4e99e9ce6a9b8efbdbdefbdaee9a0b7
UHC ?幢???????滂???幢??????? 001111111101001111010011001111110011111100111111001111110011111100111111001111111101101110110101001111110011111100111111110100111101001100111111001111110011111100111111001111110011111100111111 3fd3d33f3f3f3f3f3f3fdbb53f3f3fd3d33f3f3f3f3f3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)