To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 若??踰??楡?+烏k?誼??碎??筌 10001110111000010011111100111111111001101111101000111111001111111001111010111110001111111000000101111011100010010100011110000010100010110011111110001011011000100011111100111111111000011110101000111111001111111110001010100011 8ee13f3fe6fa3f3f9ebe3f817b8947828b3f8b623f3fe1ea3f3fe2a3
EUC-JP 若??踰??楡?+烏k?誼??碎??筌 10111100111000110011111100111111111011001111110000111111001111111101110011000000001111111010000111011100101100011010100010100011111010110011111110110101110000110011111100111111111000101110110000111111001111111110010010100101 bce33f3fecfc3f3fdcc03fa1dcb1a8a3eb3fb5c33f3fe2ec3f3fe4a5
UTF-8 若뽧꺀踰ㅿ㎘楡⑹+烏k맧誼욥윹碎ㅼ죦筌 111010001000101110100101111010111011110110100111111010101011101010000000111010001011100010110000111000111000010110111111111000111000111010011000111001101010010110100001111000101001000110111001111011111011110010001011111001111000001110001111111011111011110110001011111010111010011110100111111010001010101010111100111011001001101010100101111011001001110010111001111001111010001010001110111000111000010110111100111011001010001110100110111001111010110110001100 e88ba5ebbda7eaba80e8b8b0e385bfe38e98e6a5a1e291b9efbc8be7838fefbd8beba7a7e8aabcec9aa5ec9cb9e7a28ee385bceca3a6e7ad8c
UHC 若뽧꺀踰ㅿ㎘楡⑹+烏k맧誼욥윹碎ㅼ죦筌 1110010110110100100101101110001110000011101010011110101110110010101001001110111110100111101001011110101011111000101010011110110010100011101010111110100010100001101000111110101110010000101100001110101111111110101111111110100110011111101100111110000111101111101001001110110010100001100000011110111110100111 e5b496e383a9ebb2a4efa7a5eaf8a9eca3abe8a1a3eb90b0ebfebfe99fb3e1efa4eca181efa7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)