To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????W}?????????W{^ 0011111100111111001111110011111100111111001111110011111100111111001111110101011101111101001111110011111100111111001111110011111100111111001111110011111100111111010101110111101101011110 3f3f3f3f3f3f3f3f3f577d3f3f3f3f3f3f3f3f3f577b5e
SJIS-WIN ???誼↑?儀??W}???誼↑?儀??W{^ 0011111100111111001111111000101101100010100000011010101000111111100010110101011000111111001111110101011101111101001111110011111100111111100010110110001010000001101010100011111110001011010101100011111100111111010101110111101101011110 3f3f3f8b6281aa3f8b563f3f577d3f3f3f8b6281aa3f8b563f3f577b5e
EUC-JP ???誼↑?儀??W}???誼↑?儀??W{^ 0011111100111111001111111011010111000011101000101010110000111111101101011011011100111111001111110101011101111101001111110011111100111111101101011100001110100010101011000011111110110101101101110011111100111111010101110111101101011110 3f3f3fb5c3a2ac3fb5b73f3f577d3f3f3fb5c3a2ac3fb5b73f3f577b5e
UTF-8 吳됰쓹誼↑퍨儀뺤젌W}吳됰쓹誼↑퍨儀뺤젌W{^ 1110010110010000101100111110101110010000101100001110110010010011101110011110100010101010101111001110001010000110100100011110110110001101101010001110010110000100100000001110101110111010101001001110110010100000100011000101011101111101111001011001000010110011111010111001000010110000111011001001001110111001111010001010101010111100111000101000011010010001111011011000110110101000111001011000010010000000111010111011101010100100111011001010000010001100010101110111101101011110 e590b3eb90b0ec93b9e8aabce28691ed8da8e58480ebbaa4eca08c577de590b3eb90b0ec93b9e8aabce28691ed8da8e58480ebbaa4eca08c577b5e
UHC 吳됰쓹誼↑퍨儀뺤젌W}吳됰쓹誼↑퍨儀뺤젌W{^ 1110011111101111100010011110101110011101100101011110101111111110101000011110100010111011100111111110101111110000100101011110110010100000100011010101011101111101111001111110111110001001111010111001110110010101111010111111111010100001111010001011101110011111111010111111000010010101111011001010000010001101010101110111101101011110 e7ef89eb9d95ebfea1e8bb9febf095eca08d577de7ef89eb9d95ebfea1e8bb9febf095eca08d577b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)