To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????????AB 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110100000101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f4142
SJIS-WIN 湿芝湿湿湿芝湿漆湿柴湿痔湿芝湿而湿柴シ狆ニ湿嫉AB 10001110101111001000111011000101100011101011110010001110101111001000111010111100100011101100010110001110101111001000111010111101100011101011110010001110110001001000111010111100100011101010010010001110101111001000111011000101100011101011110010001110101001111000111010111100100011101100010010111100111000001011110011000110100011101011110010001110101110010100000101000010 8ebc8ec58ebc8ebc8ebc8ec58ebc8ebd8ebc8ec48ebc8ea48ebc8ec58ebc8ea78ebc8ec4bce0bcc68ebc8eb94142
EUC-JP 湿芝湿湿湿芝湿漆湿柴湿痔湿芝湿而湿柴シ狆ニ湿嫉AB 101111001011111010111100110001111011110010111110101111001011111010111100101111101011110011000111101111001011111010111100101111111011110010111110101111001100011010111100101111101011110010100110101111001011111010111100110001111011110010111110101111001010100110111100101111101011110011000110100011101011110011100000101111101000111011000110101111001011111010111100101110110100000101000010 bcbebcc7bcbebcbebcbebcc7bcbebcbfbcbebcc6bcbebca6bcbebcc7bcbebca9bcbebcc68ebce0be8ec6bcbebcbb4142
UTF-8 湿芝湿湿湿芝湿漆湿柴湿痔湿芝湿而湿柴シ狆ニ湿嫉AB 1110011010111001101111111110100010001010100111011110011010111001101111111110011010111001101111111110011010111001101111111110100010001010100111011110011010111001101111111110011010111100100001101110011010111001101111111110011010011111101101001110011010111001101111111110011110010111100101001110011010111001101111111110100010001010100111011110011010111001101111111110100010000000100011001110011010111001101111111110011010011111101101001110111110111101101111001110011110001011100001101110111110111110100001101110011010111001101111111110010110101011100010010100000101000010 e6b9bfe88a9de6b9bfe6b9bfe6b9bfe88a9de6b9bfe6bc86e6b9bfe69fb4e6b9bfe79794e6b9bfe88a9de6b9bfe8808ce6b9bfe69fb4efbdbce78b86efbe86e6b9bfe5ab894142
UHC ?芝???芝?漆?柴?痔?芝?而?柴????嫉AB 00111111111100101011100100111111001111110011111111110010101110010011111111110110110101000011111111100011110000110011111111110110110000000011111111110010101110010011111111101100101110110011111111100011110000110011111100111111001111110011111111110010111011000100000101000010 3ff2b93f3f3ff2b93ff6d43fe3c33ff6c03ff2b93fecbb3fe3c33f3f3f3ff2ec4142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)