To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????WB 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110101011101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5742
SJIS-WIN 偲ヲト自偲ヲト雫偲、ト疾偲ヲト而偲ヲトーナ、ト室WB 100011101100001110100110110001001000111010101001100011101100001110100110110001001000111010110100100011101100001110100100110001001000111010111110100011101100001110100110110001001000111010100111100011101100001110100110110001001011000011000101101001001100010010001110101110100101011101000010 8ec3a6c48ea98ec3a6c48eb48ec3a4c48ebe8ec3a6c48ea78ec3a6c4b0c5a4c48eba5742
EUC-JP 偲ヲト自偲ヲト雫偲、ト疾偲ヲト而偲ヲトーナ、ト室WB 1011110011000101100011101010011010001110110001001011110010101011101111001100010110001110101001101000111011000100101111001011011010111100110001011000111010100100100011101100010010111100110000001011110011000101100011101010011010001110110001001011110010101001101111001100010110001110101001101000111011000100100011101011000010001110110001011000111010100100100011101100010010111100101111000101011101000010 bcc58ea68ec4bcabbcc58ea68ec4bcb6bcc58ea48ec4bcc0bcc58ea68ec4bca9bcc58ea68ec48eb08ec58ea48ec4bcbc5742
UTF-8 偲ヲト自偲ヲト雫偲、ト疾偲ヲト而偲ヲトーナ、ト室WB 1110010110000001101100101110111110111101101001101110111110111110100001001110100010000111101010101110010110000001101100101110111110111101101001101110111110111110100001001110100110011011101010111110010110000001101100101110111110111101101001001110111110111110100001001110011110010110101111101110010110000001101100101110111110111101101001101110111110111110100001001110100010000000100011001110010110000001101100101110111110111101101001101110111110111110100001001110111110111101101100001110111110111110100001011110111110111101101001001110111110111110100001001110010110101110101001000101011101000010 e581b2efbda6efbe84e887aae581b2efbda6efbe84e99babe581b2efbda4efbe84e796bee581b2efbda6efbe84e8808ce581b2efbda6efbe84efbdb0efbe85efbda4efbe84e5aea45742
UHC ???自???????疾???而???????室WB 001111110011111100111111111011011011101100111111001111110011111100111111001111110011111100111111111100101111000000111111001111110011111111101100101110110011111100111111001111110011111100111111001111110011111111100011111110000101011101000010 3f3f3fedbb3f3f3f3f3f3f3ff2f03f3f3fecbb3f3f3f3f3f3f3fe3f85742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)