To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???意③?擬??甸??畏??撓???o?^ 0011111100111111001111111000100011010011100001110100001000111111100010110101101100111111001111111001100110110010001111110011111110001000110110000011111100111111100111011001101000111111001111110011111110000010100011110011111101011110 3f3f3f88d387423f8b5b3f3f99b23f3f88d83f3f9d9a3f3f3f828f3f5e
EUC-JP ???意??擬??甸??畏??撓???o?^ 00111111001111110011111110110000110101010011111100111111101101011011110000111111001111111101001010110100001111110011111110110000110110100011111100111111110110011111101000111111001111110011111110100011111011110011111101011110 3f3f3fb0d53f3fb5bc3f3fd2b43f3fb0da3f3fd9fa3f3f3fa3ef3f5e
UTF-8 樂꾦닄意③뾿擬대젫甸묌썑畏띿뵪撓뉗뼱溜o쨵^ 11101111101001101011111111101010101111101010011011101011100010111000010011100110100001001000111111100010100100011010001011101011101111101011111111100110100100111010110011101011100011001000000011101100101000001010101111100111100101001011100011101011101011001000110011101100100011011001000111100111100101011000111111101011100111011011111111101011101101011010101011100110100100101001001111101011100010011001011111101011101111001011000111101111101001111000101111101111101111011000111111101100101010001011010101011110 efa6bfeabea6eb8b84e6848fe291a2ebbebfe693aceb8c80eca0abe794b8ebac8cec8d91e7958feb9dbfebb5aae69293eb8997ebbcb1efa78befbd8feca8b55e
UHC 樂꾦닄意③뾿擬대젫甸묌썑畏띿뵪撓뉗뼱溜o쨵^ 11101000111110011000010011101001100010001000110111101011111100101010100011101001100101111000011111101011111101001011010011101011101000001010001111101111101001001001000111101001100110111000010011101000111001101000110111101100100101001010100011101000111101011000011111101100100101101011010011101010111111101010001111101111101001001000111101011110 e8f984e9888debf2a8e99787ebf4b4eba0a3efa491e99b84e8e68dec94a8e8f587ec96b4eafea3efa48f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)