To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???nR???n^[???nR???n^[^ 0011111100111111001111110110111001010010001111110011111100111111011011100101111001011011001111110011111100111111011011100101001000111111001111110011111101101110010111100101101101011110 3f3f3f6e523f3f3f6e5e5b3f3f3f6e523f3f3f6e5e5b5e
SJIS-WIN ?烟關nR?烟關n^[?烟關nR?烟關n^[^ 00111111111000000111110011101000100100000110111001010010001111111110000001111100111010001001000001101110010111100101101100111111111000000111110011101000100100000110111001010010001111111110000001111100111010001001000001101110010111100101101101011110 3fe07ce8906e523fe07ce8906e5e5b3fe07ce8906e523fe07ce8906e5e5b5e
EUC-JP ?烟關nR?烟關n^[?烟關nR?烟關n^[^ 00111111110111111101110111101111111100000110111001010010001111111101111111011101111011111111000001101110010111100101101100111111110111111101110111101111111100000110111001010010001111111101111111011101111011111111000001101110010111100101101101011110 3fdfddeff06e523fdfddeff06e5e5b3fdfddeff06e523fdfddeff06e5e5b5e
UTF-8 뤗烟關nR뤗烟關n^[뤗烟關nR뤗烟關n^[^ 1110101110100100100101111110011110000011100111111110100110010111100111000110111001010010111010111010010010010111111001111000001110011111111010011001011110011100011011100101111001011011111010111010010010010111111001111000001110011111111010011001011110011100011011100101001011101011101001001001011111100111100000111001111111101001100101111001110001101110010111100101101101011110 eba497e7839fe9979c6e52eba497e7839fe9979c6e5e5beba497e7839fe9979c6e52eba497e7839fe9979c6e5e5b5e
UHC 뤗烟關nR뤗烟關n^[뤗烟關nR뤗烟關n^[^ 1000111111000111111001101101001111001110101111000110111001010010100011111100011111100110110100111100111010111100011011100101111001011011100011111100011111100110110100111100111010111100011011100101001010001111110001111110011011010011110011101011110001101110010111100101101101011110 8fc7e6d3cebc6e528fc7e6d3cebc6e5e5b8fc7e6d3cebc6e528fc7e6d3cebc6e5e5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)