To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????}v????????????}vB 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110111110101110110001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f3f3f3f3f7d7642
SJIS-WIN セュ爾セ、鉦ヒ辞爾セャ宍}vセュ爾セ、鉦ヒ辞爾セャ宍}vB 101111101010110110001110101000101011111010100100100011111101111011001011100011101010101110001110101000101011111010101100100011101011001101111101011101101011111010101101100011101010001010111110101001001000111111011110110010111000111010101011100011101010001010111110101011001000111010110011011111010111011001000010 bead8ea2bea48fdecb8eab8ea2beac8eb37d76bead8ea2bea48fdecb8eab8ea2beac8eb37d7642
EUC-JP セュ爾セ、鉦ヒ辞爾セャ宍}vセュ爾セ、鉦ヒ辞爾セャ宍}vB 1000111010111110100011101010110110111100101001001000111010111110100011101010010010111110111000001000111011001011101111001010110110111100101001001000111010111110100011101010110010111100101101010111110101110110100011101011111010001110101011011011110010100100100011101011111010001110101001001011111011100000100011101100101110111100101011011011110010100100100011101011111010001110101011001011110010110101011111010111011001000010 8ebe8eadbca48ebe8ea4bee08ecbbcadbca48ebe8eacbcb57d768ebe8eadbca48ebe8ea4bee08ecbbcadbca48ebe8eacbcb57d7642
UTF-8 セュ爾セ、鉦ヒ辞爾セャ宍}vセュ爾セ、鉦ヒ辞爾セャ宍}vB 1110111110111101101111101110111110111101101011011110011110001000101111101110111110111101101111101110111110111101101001001110100110001001101001101110111110111110100010111110100010111110100111101110011110001000101111101110111110111101101111101110111110111101101011001110010110101110100011010111110101110110111011111011110110111110111011111011110110101101111001111000100010111110111011111011110110111110111011111011110110100100111010011000100110100110111011111011111010001011111010001011111010011110111001111000100010111110111011111011110110111110111011111011110110101100111001011010111010001101011111010111011001000010 efbdbeefbdade788beefbdbeefbda4e989a6efbe8be8be9ee788beefbdbeefbdace5ae8d7d76efbdbeefbdade788beefbdbeefbda4e989a6efbe8be8be9ee788beefbdbeefbdace5ae8d7d7642
UHC ??爾??鉦??爾???}v??爾??鉦??爾???}vB 0011111100111111111011001011001100111111001111111110111111111010001111110011111111101100101100110011111100111111001111110111110101110110001111110011111111101100101100110011111100111111111011111111101000111111001111111110110010110011001111110011111100111111011111010111011001000010 3f3fecb33f3feffa3f3fecb33f3f3f7d763f3fecb33f3feffa3f3fecb33f3f3f7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)