To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???nR???n^[???nR???n^[^ 0011111100111111001111110110111001010010001111110011111100111111011011100101111001011011001111110011111100111111011011100101001000111111001111110011111101101110010111100101101101011110 3f3f3f6e523f3f3f6e5e5b3f3f3f6e523f3f3f6e5e5b5e
SJIS-WIN ??娩nR??娩n^[??娩nR??娩n^[^ 001111110011111110010101110110000110111001010010001111110011111110010101110110000110111001011110010110110011111100111111100101011101100001101110010100100011111100111111100101011101100001101110010111100101101101011110 3f3f95d86e523f3f95d86e5e5b3f3f95d86e523f3f95d86e5e5b5e
EUC-JP ?琁娩nR?琁娩n^[?琁娩nR?琁娩n^[^ 0011111110001111110011001010001111001010110110100110111001010010001111111000111111001100101000111100101011011010011011100101111001011011001111111000111111001100101000111100101011011010011011100101001000111111100011111100110010100011110010101101101001101110010111100101101101011110 3f8fcca3cada6e523f8fcca3cada6e5e5b3f8fcca3cada6e523f8fcca3cada6e5e5b5e
UTF-8 겠琁娩nR겠琁娩n^[겠琁娩nR겠琁娩n^[^ 1110101010110010101000001110011110010000100000011110010110101000101010010110111001010010111010101011001010100000111001111001000010000001111001011010100010101001011011100101111001011011111010101011001010100000111001111001000010000001111001011010100010101001011011100101001011101010101100101010000011100111100100001000000111100101101010001010100101101110010111100101101101011110 eab2a0e79081e5a8a96e52eab2a0e79081e5a8a96e5e5beab2a0e79081e5a8a96e52eab2a0e79081e5a8a96e5e5b5e
UHC 겠琁娩nR겠琁娩n^[겠琁娩nR겠琁娩n^[^ 1011000011011010111000001100010011011000101101000110111001010010101100001101101011100000110001001101100010110100011011100101111001011011101100001101101011100000110001001101100010110100011011100101001010110000110110101110000011000100110110001011010001101110010111100101101101011110 b0dae0c4d8b46e52b0dae0c4d8b46e5e5bb0dae0c4d8b46e52b0dae0c4d8b46e5e5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)