To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????}v????????????}vB 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110111110101110110001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f3f3f3f3f7d7642
SJIS-WIN セャ蔀セュ社セャ実セュ痔}vセャ蔀セュ社セャ実セュ痔}vB 10111110101011001000111011000001101111101010110110001110110100001011111010101100100011101100000010111110101011011000111010100100011111010111011010111110101011001000111011000001101111101010110110001110110100001011111010101100100011101100000010111110101011011000111010100100011111010111011001000010 beac8ec1bead8ed0beac8ec0bead8ea47d76beac8ec1bead8ed0beac8ec0bead8ea47d7642
EUC-JP セャ蔀セュ社セャ実セュ痔}vセャ蔀セュ社セャ実セュ痔}vB 1000111010111110100011101010110010111100110000111000111010111110100011101010110110111100110100101000111010111110100011101010110010111100110000101000111010111110100011101010110110111100101001100111110101110110100011101011111010001110101011001011110011000011100011101011111010001110101011011011110011010010100011101011111010001110101011001011110011000010100011101011111010001110101011011011110010100110011111010111011001000010 8ebe8eacbcc38ebe8eadbcd28ebe8eacbcc28ebe8eadbca67d768ebe8eacbcc38ebe8eadbcd28ebe8eacbcc28ebe8eadbca67d7642
UTF-8 セャ蔀セュ社セャ実セュ痔}vセャ蔀セュ社セャ実セュ痔}vB 1110111110111101101111101110111110111101101011001110100010010100100000001110111110111101101111101110111110111101101011011110011110100100101111101110111110111101101111101110111110111101101011001110010110101110100111111110111110111101101111101110111110111101101011011110011110010111100101000111110101110110111011111011110110111110111011111011110110101100111010001001010010000000111011111011110110111110111011111011110110101101111001111010010010111110111011111011110110111110111011111011110110101100111001011010111010011111111011111011110110111110111011111011110110101101111001111001011110010100011111010111011001000010 efbdbeefbdace89480efbdbeefbdade7a4beefbdbeefbdace5ae9fefbdbeefbdade797947d76efbdbeefbdace89480efbdbeefbdade7a4beefbdbeefbdace5ae9fefbdbeefbdade797947d7642
UHC ?????社?????痔}v?????社?????痔}vB 001111110011111100111111001111110011111111011110111001000011111100111111001111110011111100111111111101101100000001111101011101100011111100111111001111110011111100111111110111101110010000111111001111110011111100111111001111111111011011000000011111010111011001000010 3f3f3f3f3fdee43f3f3f3f3ff6c07d763f3f3f3f3fdee43f3f3f3f3ff6c07d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)