To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???意③?擬???ラ?畏??慂ъ????^ 0011111100111111001111111000100011010011100001110100001000111111100010110101101100111111001111110011111110000011100010010011111110001000110110000011111100111111100111001100100010000100100011000011111100111111001111110011111101011110 3f3f3f88d387423f8b5b3f3f3f83893f88d83f3f9cc8848c3f3f3f3f5e
EUC-JP ???意??擬???ラ?畏??慂ъ????^ 00111111001111110011111110110000110101010011111100111111101101011011110000111111001111110011111110100101111010010011111110110000110110100011111100111111110110001100101010100111111011000011111100111111001111110011111101011110 3f3f3fb0d53f3fb5bc3f3f3fa5e93fb0da3f3fd8caa7ec3f3f3f3f5e
UTF-8 樂꾦닄意③쉬擬⑸젇列ラ썑畏띿뵪慂ъ뼦溜딀씟^ 111011111010011010111111111010101011111010100110111010111000101110000100111001101000010010001111111000101001000110100010111011001000100110101100111001101001001110101100111000101001000110111000111011001010000010000111111011111010011010011100111000111000001110101001111011001000110110010001111001111001010110001111111010111001110110111111111010111011010110101010111001101000010110000010110100011000101011101011101111001010011011101111101001111000101111101011100101001000000011101100100101001001111101011110 efa6bfeabea6eb8b84e6848fe291a2ec89ace693ace291b8eca087efa69ce383a9ec8d91e7958feb9dbfebb5aae68582d18aebbca6efa78beb9480ec949f5e
UHC 樂꾦닄意③쉬擬⑸젇列ラ썑畏띿뵪慂ъ뼦溜딀씟^ 11101000111110011000010011101001100010001000110111101011111100101010100011101001101111011010110011101011111101001010100111101011101000001000101011100110111010101010101111101001100110111000010011101000111001101000110111101100100101001010100011101001101111011010110011101100100101101010100111101010111111101000101011100110100111011011001101011110 e8f984e9888debf2a8e9bdacebf4a9eba08ae6eaabe99b84e8e68dec94a8e9bdacec96a9eafe8ae69db35e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)