To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 遲鯉スろォ∫ュ鷺槇遲鯉スろォ∫ュ作権B 11100111101011011000110011101111101111011000001011101011101010111000000111100111101011011000110111101011111010101010000011100111101011011000110011101111101111011000001011101011101010111000000111100111101011011000110111101100100011001010000001000010 e7ad8cefbd82ebab81e7ad8debeaa0e7ad8cefbd82ebab81e7ad8dec8ca042
EUC-JP 遲鯉スろォ∫ュ鷺槇遲鯉スろォ∫ュ作権B 11101110101011111011100011110001100011101011110110100100111011011000111010101011101000101110100110001110101011011011101011101101111101001010001011101110101011111011100011110001100011101011110110100100111011011000111010101011101000101110100110001110101011011011101011101110101110001010001001000010 eeafb8f18ebda4ed8eaba2e98eadbaedf4a2eeafb8f18ebda4ed8eaba2e98eadbaeeb8a242
UTF-8 遲鯉スろォ∫ュ鷺槇遲鯉スろォ∫ュ作権B 11101001100000011011001011101001101011111000100111101111101111011011110111100011100000101000110111101111101111011010101111100010100010001010101111101111101111011010110111101001101101111011101011100110101001111000011111101001100000011011001011101001101011111000100111101111101111011011110111100011100000101000110111101111101111011010101111100010100010001010101111101111101111011010110111100100101111011001110011100110101010001010100101000010 e981b2e9af89efbdbde3828defbdabe288abefbdade9b7bae6a787e981b2e9af89efbdbde3828defbdabe288abefbdade4bd9ce6a8a942
UHC 遲鯉?ろ?∫?鷺?遲鯉?ろ?∫?作?B 1111001011000000110101111110111100111111101010101110110100111111101000011111001000111111110101101101110000111111111100101100000011010111111011110011111110101010111011010011111110100001111100100011111111101101110000100011111101000010 f2c0d7ef3faaed3fa1f23fd6dc3ff2c0d7ef3faaed3fa1f23fedc23f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)