To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????gB 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110110011101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f6742
SJIS-WIN 偲自篠コト疾偲識篠漆篠嫉偲而篠痔篠室gB 100011101100001110001110101010011000111011000010101110101100010010001110101111101000111011000011100011101010111110001110110000101000111010111101100011101100001010001110101110011000111011000011100011101010011110001110110000101000111010100100100011101100001010001110101110100110011101000010 8ec38ea98ec2bac48ebe8ec38eaf8ec28ebd8ec28eb98ec38ea78ec28ea48ec28eba6742
EUC-JP 偲自篠コト疾偲識篠漆篠嫉偲而篠痔篠室gB 1011110011000101101111001010101110111100110001001000111010111010100011101100010010111100110000001011110011000101101111001011000110111100110001001011110010111111101111001100010010111100101110111011110011000101101111001010100110111100110001001011110010100110101111001100010010111100101111000110011101000010 bcc5bcabbcc48eba8ec4bcc0bcc5bcb1bcc4bcbfbcc4bcbbbcc5bca9bcc4bca6bcc4bcbc6742
UTF-8 偲自篠コト疾偲識篠漆篠嫉偲而篠痔篠室gB 1110010110000001101100101110100010000111101010101110011110101111101000001110111110111101101110101110111110111110100001001110011110010110101111101110010110000001101100101110100010101101100110001110011110101111101000001110011010111100100001101110011110101111101000001110010110101011100010011110010110000001101100101110100010000000100011001110011110101111101000001110011110010111100101001110011110101111101000001110010110101110101001000110011101000010 e581b2e887aae7afa0efbdbaefbe84e796bee581b2e8ad98e7afa0e6bc86e7afa0e5ab89e581b2e8808ce7afa0e79794e7afa0e5aea46742
UHC ?自篠??疾?識篠漆篠嫉?而篠痔篠室gB 001111111110110110111011111000011100011000111111001111111111001011110000001111111110001111011011111000011100011011110110110101001110000111000110111100101110110000111111111011001011101111100001110001101111011011000000111000011100011011100011111110000110011101000010 3fedbbe1c63f3ff2f03fe3dbe1c6f6d4e1c6f2ec3fecbbe1c6f6c0e1c6e3f86742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)