To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 硝羮ヲ硝ォ湿辞n}硝羮ヲ硝ォ湿辞n{^ 100011111100100111100011101111001010011010001111110010011010101110001110101111001111001011001100100011101010101101101110011111011000111111001001111000111011110010100110100011111100100110101011100011101011110011110010110011001000111010101011011011100111101101011110 8fc9e3bca68fc9ab8ebcf2cc8eab6e7d8fc9e3bca68fc9ab8ebcf2cc8eab6e7b5e
EUC-JP 硝羮ヲ硝ォ湿?辞n}硝羮ヲ硝ォ湿?辞n{^ 1011111011001011111001101011111010001110101001101011111011001011100011101010101110111100101111100011111110111100101011010110111001111101101111101100101111100110101111101000111010100110101111101100101110001110101010111011110010111110001111111011110010101101011011100111101101011110 becbe6be8ea6becb8eabbcbe3fbcad6e7dbecbe6be8ea6becb8eabbcbe3fbcad6e7b5e
UTF-8 硝羮ヲ硝ォ湿辞n}硝羮ヲ硝ォ湿辞n{^ 1110011110100001100111011110011110111110101011101110111110111101101001101110011110100001100111011110111110111101101010111110011010111001101111111110111010001000100000111110100010111110100111100110111001111101111001111010000110011101111001111011111010101110111011111011110110100110111001111010000110011101111011111011110110101011111001101011100110111111111011101000100010000011111010001011111010011110011011100111101101011110 e7a19de7beaeefbda6e7a19defbdabe6b9bfee8883e8be9e6e7de7a19de7beaeefbda6e7a19defbdabe6b9bfee8883e8be9e6e7b5e
UHC 硝??硝????n}硝??硝????n{^ 11110101101001100011111100111111111101011010011000111111001111110011111100111111011011100111110111110101101001100011111100111111111101011010011000111111001111110011111100111111011011100111101101011110 f5a63f3ff5a63f3f3f3f6e7df5a63f3ff5a63f3f3f3f6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)