To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???nR???n^[???nR???n^[^ 0011111100111111001111110110111001010010001111110011111100111111011011100101111001011011001111110011111100111111011011100101001000111111001111110011111101101110010111100101101101011110 3f3f3f6e523f3f3f6e5e5b3f3f3f6e523f3f3f6e5e5b5e
SJIS-WIN 茫∞嘗nR茫∞嘗n^[茫∞嘗nR茫∞嘗n^[^ 1110010010101001100000011000011110001111101001100110111001010010111001001010100110000001100001111000111110100110011011100101111001011011111001001010100110000001100001111000111110100110011011100101001011100100101010011000000110000111100011111010011001101110010111100101101101011110 e4a981878fa66e52e4a981878fa66e5e5be4a981878fa66e52e4a981878fa66e5e5b5e
EUC-JP 茫∞嘗nR茫∞嘗n^[茫∞嘗nR茫∞嘗n^[^ 1110100010101011101000011110011110111110101010000110111001010010111010001010101110100001111001111011111010101000011011100101111001011011111010001010101110100001111001111011111010101000011011100101001011101000101010111010000111100111101111101010100001101110010111100101101101011110 e8aba1e7bea86e52e8aba1e7bea86e5e5be8aba1e7bea86e52e8aba1e7bea86e5e5b5e
UTF-8 茫∞嘗nR茫∞嘗n^[茫∞嘗nR茫∞嘗n^[^ 1110100010001100101010111110001010001000100111101110010110011000100101110110111001010010111010001000110010101011111000101000100010011110111001011001100010010111011011100101111001011011111010001000110010101011111000101000100010011110111001011001100010010111011011100101001011101000100011001010101111100010100010001001111011100101100110001001011101101110010111100101101101011110 e88cabe2889ee598976e52e88cabe2889ee598976e5e5be88cabe2889ee598976e52e88cabe2889ee598976e5e5b5e
UHC 茫∞嘗nR茫∞嘗n^[茫∞嘗nR茫∞嘗n^[^ 1101100011010100101000011100010011011111110001000110111001010010110110001101010010100001110001001101111111000100011011100101111001011011110110001101010010100001110001001101111111000100011011100101001011011000110101001010000111000100110111111100010001101110010111100101101101011110 d8d4a1c4dfc46e52d8d4a1c4dfc46e5e5bd8d4a1c4dfc46e52d8d4a1c4dfc46e5e5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)