To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????W}????????W{^ 001111110011111100111111001111110011111100111111001111110011111101010111011111010011111100111111001111110011111100111111001111110011111100111111010101110111101101011110 3f3f3f3f3f3f3f3f577d3f3f3f3f3f3f3f3f577b5e
SJIS-WIN 竺捨偲爾偲耳竺捨W}竺捨偲爾偲耳竺捨W{^ 10001110101100011000111011001100100011101100001110001110101000101000111011000011100011101010100010001110101100011000111011001100010101110111110110001110101100011000111011001100100011101100001110001110101000101000111011000011100011101010100010001110101100011000111011001100010101110111101101011110 8eb18ecc8ec38ea28ec38ea88eb18ecc577d8eb18ecc8ec38ea28ec38ea88eb18ecc577b5e
EUC-JP 竺捨偲爾偲耳竺捨W}竺捨偲爾偲耳竺捨W{^ 10111100101100111011110011001110101111001100010110111100101001001011110011000101101111001010101010111100101100111011110011001110010101110111110110111100101100111011110011001110101111001100010110111100101001001011110011000101101111001010101010111100101100111011110011001110010101110111101101011110 bcb3bccebcc5bca4bcc5bcaabcb3bcce577dbcb3bccebcc5bca4bcc5bcaabcb3bcce577b5e
UTF-8 竺捨偲爾偲耳竺捨W}竺捨偲爾偲耳竺捨W{^ 1110011110101011101110101110011010001101101010001110010110000001101100101110011110001000101111101110010110000001101100101110100010000000101100111110011110101011101110101110011010001101101010000101011101111101111001111010101110111010111001101000110110101000111001011000000110110010111001111000100010111110111001011000000110110010111010001000000010110011111001111010101110111010111001101000110110101000010101110111101101011110 e7abbae68da8e581b2e788bee581b2e880b3e7abbae68da8577de7abbae68da8e581b2e788bee581b2e880b3e7abbae68da8577b5e
UHC 竺捨?爾?耳竺捨W}竺捨?爾?耳竺捨W{^ 111101011110011111011110110101110011111111101100101100110011111111101100101111001111010111100111110111101101011101010111011111011111010111100111110111101101011100111111111011001011001100111111111011001011110011110101111001111101111011010111010101110111101101011110 f5e7ded73fecb33fecbcf5e7ded7577df5e7ded73fecb33fecbcf5e7ded7577b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)