To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???nR???n^[???nR???n^[^ 0011111100111111001111110110111001010010001111110011111100111111011011100101111001011011001111110011111100111111011011100101001000111111001111110011111101101110010111100101101101011110 3f3f3f6e523f3f3f6e5e5b3f3f3f6e523f3f3f6e5e5b5e
SJIS-WIN ?鏞餓nR?鏞餓n^[?鏞餓nR?鏞餓n^[^ 00111111111110111110001110001001111011000110111001010010001111111111101111100011100010011110110001101110010111100101101100111111111110111110001110001001111011000110111001010010001111111111101111100011100010011110110001101110010111100101101101011110 3ffbe389ec6e523ffbe389ec6e5e5b3ffbe389ec6e523ffbe389ec6e5e5b5e
EUC-JP ?鏞餓nR?鏞餓n^[?鏞餓nR?鏞餓n^[^ 0011111110001111111001011100100110110010111011100110111001010010001111111000111111100101110010011011001011101110011011100101111001011011001111111000111111100101110010011011001011101110011011100101001000111111100011111110010111001001101100101110111001101110010111100101101101011110 3f8fe5c9b2ee6e523f8fe5c9b2ee6e5e5b3f8fe5c9b2ee6e523f8fe5c9b2ee6e5e5b5e
UTF-8 뤳鏞餓nR뤳鏞餓n^[뤳鏞餓nR뤳鏞餓n^[^ 1110101110100100101100111110100110001111100111101110100110100100100100110110111001010010111010111010010010110011111010011000111110011110111010011010010010010011011011100101111001011011111010111010010010110011111010011000111110011110111010011010010010010011011011100101001011101011101001001011001111101001100011111001111011101001101001001001001101101110010111100101101101011110 eba4b3e98f9ee9a4936e52eba4b3e98f9ee9a4936e5e5beba4b3e98f9ee9a4936e52eba4b3e98f9ee9a4936e5e5b5e
UHC 뤳鏞餓nR뤳鏞餓n^[뤳鏞餓nR뤳鏞餓n^[^ 1000111111100001111010011100101111100100101110110110111001010010100011111110000111101001110010111110010010111011011011100101111001011011100011111110000111101001110010111110010010111011011011100101001010001111111000011110100111001011111001001011101101101110010111100101101101011110 8fe1e9cbe4bb6e528fe1e9cbe4bb6e5e5b8fe1e9cbe4bb6e528fe1e9cbe4bb6e5e5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)