To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????應??鸚?????貫毅??違?? 001111110011111100111111001111110011111100111111100111001110010000111111001111111110101001011111001111110011111100111111001111110011111110001010110100011000101101000010001111110011111110001000111000010011111100111111 3f3f3f3f3f3f9ce43f3fea5f3f3f3f3f3f8ad18b423f3f88e13f3f
EUC-JP ???沅??應??鸚??彛??貫毅??違?? 00111111001111110011111110001111110001101110100100111111001111111101100011100110001111110011111111110011110000000011111100111111100011111011110011111010001111110011111110110100110100111011010110100011001111110011111110110000111000110011111100111111 3f3f3f8fc6e93f3fd8e63f3ff3c03f3f8fbcfa3f3fb4d3b5a33f3fb0e33f3f
UTF-8 嶺뚮뿭沅뤺굢應몃뼡鸚룐댙彛볞튃貫毅숁삌違겷낦 111011111010011010101011111010111001101010101110111010111011111110101101111001101011001010000101111010111010010010111010111010101011010110100010111001101000011110001001111010111010101010000011111010111011110010100001111010011011100010011010111010111010001110010000111010111000110010011001111001011011110110011011111010111011001110011110111011011000101010000011111010001011001010101011111001101010111110000101111011001000100010000001111011001000001010001100111010011000000110010101111010101011001010110111111010111000001010100110 efa6abeb9aaeebbfade6b285eba4baeab5a2e68789ebaa83ebbca1e9b89aeba390eb8c99e5bd9bebb39eed8a83e8b2abe6af85ec8881ec828ce98195eab2b7eb82a6
UHC 嶺뚮뿭沅뤺굢應몃뼡鸚룐댙彛볞튃貫毅숁삌違겷낦 1110011110101101100011001110101110010111101011011110101010110110100011111110100010000010100010011110101111101011101110001110101110010110101001001110010110100100101101111110001010001000101111011110110010101101100100111110010010111001100110011100111010111011111010111111011010011001111001101001100010010011111010101101111010000001110000111000011001000010 e7ad8ceb97adeab68fe88289ebebb8eb96a4e5a4b7e288bdecad93e4b999cebbebf699e69893eade81c38642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)