To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 閾ェ豐サ閾ェ豐サ辷セ隴亙閾ェ豐サ閾ェ豐サ辷セ隴亙B 111010001000011110101010111001101011001010111011111010001000011110101010111001101011001010111011111001111000100010111110111010001010110110011000011010011110100010000111101010101110011010110010101110111110100010000111101010101110011010110010101110111110011110001000101111101110100010101101100110000110100101000010 e887aae6b2bbe887aae6b2bbe788bee8ad9869e887aae6b2bbe887aae6b2bbe788bee8ad986942
EUC-JP 閾ェ豐サ閾ェ豐サ辷セ隴亙閾ェ豐サ閾ェ豐サ辷セ隴亙B 11101111111001111000111010101010111011001011010010001110101110111110111111100111100011101010101011101100101101001000111010111011111011011110100010001110101111101111000010101111110011111100101011101111111001111000111010101010111011001011010010001110101110111110111111100111100011101010101011101100101101001000111010111011111011011110100010001110101111101111000010101111110011111100101001000010 efe78eaaecb48ebbefe78eaaecb48ebbede88ebef0afcfcaefe78eaaecb48ebbefe78eaaecb48ebbede88ebef0afcfca42
UTF-8 閾ェ豐サ閾ェ豐サ辷セ隴亙閾ェ豐サ閾ェ豐サ辷セ隴亙B 11101001100101101011111011101111101111011010101011101000101100011001000011101111101111011011101111101001100101101011111011101111101111011010101011101000101100011001000011101111101111011011101111101000101111101011011111101111101111011011111011101001100110101011010011100100101110101001100111101001100101101011111011101111101111011010101011101000101100011001000011101111101111011011101111101001100101101011111011101111101111011010101011101000101100011001000011101111101111011011101111101000101111101011011111101111101111011011111011101001100110101011010011100100101110101001100101000010 e996beefbdaae8b190efbdbbe996beefbdaae8b190efbdbbe8beb7efbdbee99ab4e4ba99e996beefbdaae8b190efbdbbe996beefbdaae8b190efbdbbe8beb7efbdbee99ab4e4ba9942
UHC ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)