To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 悟??誼③??l?輿??誼??巡??暗 100011001110010100111111001111111000101101100010100001110100001000111111001111111000001010001100001111111001011101100000001111110011111110001011011000100011111100111111100011111000010000111111001111111000100011000011 8ce53f3f8b6287423f3f828c3f97603f3f8b623f3f8f843f3f88c3
EUC-JP 悟??誼??洹l?輿??誼??巡??暗 10111000111001110011111100111111101101011100001100111111001111111000111111000111101110101010001111101100001111111100110111000001001111110011111110110101110000110011111100111111101111011110010000111111001111111011000011000101 b8e73f3fb5c33f3f8fc7baa3ec3fcdc13f3fb5c33f3fbde43f3fb0c5
UTF-8 悟딅봾誼③펶洹l졁輿살눖誼꿩퐗巡볦젟暗 111001101000001010011111111010111001010010000101111010111011010010111110111010001010101010111100111000101001000110100010111011011000111010110110111001101011010010111001111011111011110110001100111011001010000110000001111010001011110010111111111011001000001010110100111010111000100010010110111010001010101010111100111010101011111110101001111011011001000010010111111001011011011110100001111010111011001110100110111011001010000010011111111001101001101010010111 e6829feb9485ebb4bee8aabce291a2ed8eb6e6b4b9efbd8ceca181e8bcbfec82b4eb8896e8aabceabfa9ed9097e5b7a1ebb3a6eca09fe69a97
UHC 悟딅봾誼③펶洹l졁輿살눖誼꿩퐗巡볦젟暗 1110011111110110100010101110101110010100100001011110101111111110101010001110100110111100100001111110101010110111101000111110110010100000101100101110011010101011101110111110110010000111101100001110101111111110101100101110011010111101100000011110001011011110100100111110110010100000100110011110010011011110 e7f68aeb9485ebfea8e9bc87eab7a3eca0b2e6abbbec87b0ebfeb2e6bd81e2de93eca099e4de

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)