To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????±? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111011000100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3fb13f
SJIS-WIN 褥?∥意??儀??語??異??擬????±泣 11100101111100010011111110000001011000011000100011010011001111110011111110001011010101100011111100111111100011001110101000111111001111111000100011011001001111110011111110001011010110110011111100111111001111110011111110000001011111011000101110000011 e5f13f816188d33f3f8b563f3f8cea3f3f88d93f3f8b5b3f3f3f3f817d8b83
EUC-JP 褥?‖意??儀??語??異??擬????±泣 11101010111100110011111110100001110000101011000011010101001111110011111110110101101101110011111100111111101110001110110000111111001111111011000011011011001111110011111110110101101111000011111100111111001111110011111110100001110111101011010111100011 eaf33fa1c2b0d53f3fb5b73f3fb8ec3f3fb0db3f3fb5bc3f3f3f3fa1deb5e3
UTF-8 褥띕∥意덌쭏儀숈춳語ⓦ끉異욥뒽擬쒕엮濾낆±泣 1110100010100100101001011110101110011101100101011110001010001000101001011110011010000100100011111110101110001101100011001110110010101101100011111110010110000100100000001110110010001000100010001110110010110110101100111110100010101010100111101110001010010011101001101110101110000001100010011110011110010101101100001110110010011010101001011110101110010010101111011110011010010011101011001110110010010010100101011110110010010111101011101110111110100110100001001110101110000010100001101100001010110001111001101011001110100011 e8a4a5eb9d95e288a5e6848feb8d8cecad8fe58480ec8888ecb6b3e8aa9ee293a6eb8189e795b0ec9aa5eb92bde693acec9295ec97aeefa684eb8286c2b1e6b3a3
UHC 褥띕∥意덌쭏儀숈춳語ⓦ끉異욥뒽擬쒕엮濾낆±泣 1110100110110011101101101110101110100001101010111110101111110010100010001110111110100111100010001110101111110000100110011110110010101101100011111110010111011110101010001110001110000101101111001110110010110110101111111110100110001010101100111110101111110100100111001110101110111111101010111110011010100100100001011110110010100001101111101110101111101000 e9b3b6eba1abebf288efa788ebf099ecad8fe5dea8e385bcecb6bfe98ab3ebf49cebbfabe6a485eca1beebe8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)