To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????W}?????????W{^ 0011111100111111001111110011111100111111001111110011111100111111001111110101011101111101001111110011111100111111001111110011111100111111001111110011111100111111010101110111101101011110 3f3f3f3f3f3f3f3f3f577d3f3f3f3f3f3f3f3f3f577b5e
SJIS-WIN 艱城8豎援境覲レ咳W}艱城8豎援境覲レ咳W{^ 1110010010000101100011111110100110000010010101111110011010110001100010011000011110001011101010111110011001010010100000111000110010001010010100000101011101111101111001001000010110001111111010011000001001010111111001101011000110001001100001111000101110101011111001100101001010000011100011001000101001010000010101110111101101011110 e4858fe98257e6b189878babe652838c8a50577de4858fe98257e6b189878babe652838c8a50577b5e
EUC-JP 艱城8豎援境覲レ咳W}艱城8豎援境覲レ咳W{^ 1110011111100101101111101110101110100011101110001110110010110011101100011110011110110110101011011110101110110011101001011110110010110011101100010101011101111101111001111110010110111110111010111010001110111000111011001011001110110001111001111011011010101101111010111011001110100101111011001011001110110001010101110111101101011110 e7e5beeba3b8ecb3b1e7b6adebb3a5ecb3b1577de7e5beeba3b8ecb3b1e7b6adebb3a5ecb3b1577b5e
UTF-8 艱城8豎援境覲レ咳W}艱城8豎援境覲レ咳W{^ 1110100010001001101100011110010110011111100011101110111110111100100110001110100010110001100011101110011010001111101101001110010110100010100000111110100010100110101100101110001110000011101011001110010110010010101100110101011101111101111010001000100110110001111001011001111110001110111011111011110010011000111010001011000110001110111001101000111110110100111001011010001010000011111010001010011010110010111000111000001110101100111001011001001010110011010101110111101101011110 e889b1e59f8eefbc98e8b18ee68fb4e5a283e8a6b2e383ace592b3577de889b1e59f8eefbc98e8b18ee68fb4e5a283e8a6b2e383ace592b3577b5e
UHC 艱城8?援境覲レ咳W}艱城8?援境覲レ咳W{^ 110010101101111011100000111100101010001110111000001111111110101010110101110011001101000111010000110011001010101111101100111110101010011001010111011111011100101011011110111000001111001010100011101110000011111111101010101101011100110011010001110100001100110010101011111011001111101010100110010101110111101101011110 cadee0f2a3b83feab5ccd1d0ccabecfaa6577dcadee0f2a3b83feab5ccd1d0ccabecfaa6577b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)