To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 棕??嶝??翁烽??攻渦嶝??翁烽?肌 1001111010100001001111110011111110011011110100010011111100111111100010011010010111100000100000100011111100111111100011010101010110001001010100011001101111010001001111110011111110001001101001011110000010000010001111111001010010100111 9ea13f3f9bd13f3f89a5e0823f3f8d5589519bd13f3f89a5e0823f94a7
EUC-JP 棕??嶝??翁烽??攻渦嶝??翁烽?肌 1101110010100011001111110011111111010110110100110011111100111111101100101010011111011111111000100011111100111111101110011011011010110001101100101101011011010011001111110011111110110010101001111101111111100010001111111100100010101001 dca33f3fd6d33f3fb2a7dfe23f3fb9b6b1b2d6d33f3fb2a7dfe23fc8a9
UTF-8 棕흑렯嶝렰렰翁烽렑陋攻渦嶝렰렰翁烽렑肌 111001101010001110010101111011011001110110010001111010111010000010101111111001011011011010011101111010111010000010110000111010111010000010110000111001111011111110000001111001111000001110111101111010111010000010010001111011111010010110010001111001101001010010111011111001101011100010100110111001011011011010011101111010111010000010110000111010111010000010110000111001111011111110000001111001111000001110111101111010111010000010010001111010001000001010001100 e6a395ed9d91eba0afe5b69deba0b0eba0b0e7bf81e783bdeba091efa591e694bbe6b8a6e5b69deba0b0eba0b0e7bf81e783bdeba091e8828c
UHC 棕흑렯嶝렰렰翁烽렑陋攻渦嶝렰렰翁烽렑肌 1111000011110111110010001110011010001110101111001101010011110001100011101011110110001110101111011110100010111010110111001110101110001110101001101101001011101011110011011111010011101000101111101101010011110001100011101011110110001110101111011110100010111010110111001110101110001110101001101101000110111111 f0f7c8e68ebcd4f18ebd8ebde8badceb8ea6d2ebcdf4e8bed4f18ebd8ebde8badceb8ea6d1bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)