To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????^h????^fN}????^h????^fN{^ 0011111100111111001111110011111101011110011010000011111100111111001111110011111101011110011001100100111001111101001111110011111100111111001111110101111001101000001111110011111100111111001111110101111001100110010011100111101101011110 3f3f3f3f5e683f3f3f3f5e664e7d3f3f3f3f5e683f3f3f3f5e664e7b5e
SJIS-WIN 娼ヘ愰フ^h娼ヘ愰フ^fN}娼ヘ愰フ^h娼ヘ愰フ^fN{^ 10001111101010011100110111111010110001011100110001011110011010001000111110101001110011011111101011000101110011000101111001100110010011100111110110001111101010011100110111111010110001011100110001011110011010001000111110101001110011011111101011000101110011000101111001100110010011100111101101011110 8fa9cdfac5cc5e688fa9cdfac5cc5e664e7d8fa9cdfac5cc5e688fa9cdfac5cc5e664e7b5e
EUC-JP 娼ヘ愰フ^h娼ヘ愰フ^fN}娼ヘ愰フ^h娼ヘ愰フ^fN{^ 10111110101010111000111011001101100011111011111011001001100011101100110001011110011010001011111010101011100011101100110110001111101111101100100110001110110011000101111001100110010011100111110110111110101010111000111011001101100011111011111011001001100011101100110001011110011010001011111010101011100011101100110110001111101111101100100110001110110011000101111001100110010011100111101101011110 beab8ecd8fbec98ecc5e68beab8ecd8fbec98ecc5e664e7dbeab8ecd8fbec98ecc5e68beab8ecd8fbec98ecc5e664e7b5e
UTF-8 娼ヘ愰フ^h娼ヘ愰フ^fN}娼ヘ愰フ^h娼ヘ愰フ^fN{^ 11100101101010001011110011101111101111101000110111100110100001001011000011101111101111101000110001011110011010001110010110101000101111001110111110111110100011011110011010000100101100001110111110111110100011000101111001100110010011100111110111100101101010001011110011101111101111101000110111100110100001001011000011101111101111101000110001011110011010001110010110101000101111001110111110111110100011011110011010000100101100001110111110111110100011000101111001100110010011100111101101011110 e5a8bcefbe8de684b0efbe8c5e68e5a8bcefbe8de684b0efbe8c5e664e7de5a8bcefbe8de684b0efbe8c5e68e5a8bcefbe8de684b0efbe8c5e664e7b5e
UHC 娼?愰?^h娼?愰?^fN}娼?愰?^h娼?愰?^fN{^ 11110011110111100011111111111100110010100011111101011110011010001111001111011110001111111111110011001010001111110101111001100110010011100111110111110011110111100011111111111100110010100011111101011110011010001111001111011110001111111111110011001010001111110101111001100110010011100111101101011110 f3de3ffcca3f5e68f3de3ffcca3f5e664e7df3de3ffcca3f5e68f3de3ffcca3f5e664e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)