To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????W}???????????W{^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101010111011111010011111100111111001111110011111100111111001111110011111100111111001111110011111100111111010101110111101101011110 3f3f3f3f3f3f3f3f3f3f3f577d3f3f3f3f3f3f3f3f3f3f3f577b5e
SJIS-WIN 貞??敎ρ汁?????W}貞??敎ρ汁?????W{^ 1001001011100101001111110011111111111010110011011000001111001111100011110110000000111111001111110011111100111111001111110101011101111101100100101110010100111111001111111111101011001101100000111100111110001111011000000011111100111111001111110011111100111111010101110111101101011110 92e53f3ffacd83cf8f603f3f3f3f3f577d92e53f3ffacd83cf8f603f3f3f3f3f577b5e
EUC-JP 貞???ρ汁?????W}貞???ρ汁?????W{^ 110001001110011100111111001111110011111110100110110100011011110111000001001111110011111100111111001111110011111101010111011111011100010011100111001111110011111100111111101001101101000110111101110000010011111100111111001111110011111100111111010101110111101101011110 c4e73f3f3fa6d1bdc13f3f3f3f3f577dc4e73f3f3fa6d1bdc13f3f3f3f3f577b5e
UTF-8 貞쭸렫敎ρ汁흗렩쾨렯렞W}貞쭸렫敎ρ汁흗렩쾨렯렞W{^ 111010001011001010011110111011001010110110111000111010111010000010101011111001101001010110001110110011111000000111100110101100011000000111101101100111011001011111101011101000001010100111101100101111101010100011101011101000001010111111101011101000001001111001010111011111011110100010110010100111101110110010101101101110001110101110100000101010111110011010010101100011101100111110000001111001101011000110000001111011011001110110010111111010111010000010101001111011001011111010101000111010111010000010101111111010111010000010011110010101110111101101011110 e8b29eecadb8eba0abe6958ecf81e6b181ed9d97eba0a9ecbea8eba0afeba09e577de8b29eecadb8eba0abe6958ecf81e6b181ed9d97eba0a9ecbea8eba0afeba09e577b5e
UHC 貞쭸렫敎ρ汁흗렩쾨렯렞W}貞쭸렫敎ρ汁흗렩쾨렯렞W{^ 11101111111101101100001011100110100011101011100111001110111001111010010111110001111100011111000011001000111010011000111010110111110001001110101010001110101111001000111010101111010101110111110111101111111101101100001011100110100011101011100111001110111001111010010111110001111100011111000011001000111010011000111010110111110001001110101010001110101111001000111010101111010101110111101101011110 eff6c2e68eb9cee7a5f1f1f0c8e98eb7c4ea8ebc8eaf577deff6c2e68eb9cee7a5f1f1f0c8e98eb7c4ea8ebc8eaf577b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)