To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 汚????絶???т?節????絶???ф? 10001001100110000011111100111111001111110011111110010000111000100011111100111111001111111000010010000100001111111001000011011111001111110011111100111111001111111001000011100010001111110011111100111111100001001000011000111111 89983f3f3f3f90e23f3f3f84843f90df3f3f3f3f90e23f3f3f84863f
EUC-JP 汚????絶???т?節????絶???ф? 10110001111110000011111100111111001111110011111111000000111001000011111100111111001111111010011111100100001111111100000011100001001111110011111100111111001111111100000011100100001111110011111100111111101001111110011000111111 b1f83f3f3f3fc0e43f3f3fa7e43fc0e13f3f3f3fc0e43f3f3fa7e63f
UTF-8 汚얌퍢說튚絶먲쉼料т툧節룝퍢說튚絶먲쉼料ф닅 11100110101100011001101011101100100101101000110011101101100011011010001011101111101001101010000111101101100010101001101011100111101101011011011011101011101010001011001011101100100010011011110011101111101001101011111011010001100000101110110110001000101001111110011110101111100000001110101110100011100111011110110110001101101000101110111110100110101000011110110110001010100110101110011110110101101101101110101110101000101100101110110010001001101111001110111110100110101111101101000110000100111010111000101110000101 e6b19aec968ced8da2efa6a1ed8a9ae7b5b6eba8b2ec89bcefa6bed182ed88a7e7af80eba39ded8da2efa6a1ed8a9ae7b5b6eba8b2ec89bcefa6bed184eb8b85
UHC 汚얌퍢說튚絶먲쉼料т툧節룝퍢說튚絶먲쉼料ф닅 1110011111111101101111101110010010111011100110011110011011110010101110100100101111101111101111101001000011101111101111011011000011101000111101111010110011100100101110001001111011101111101111011011011111100100101110111001100111100110111100101011101001001011111011111011111010010000111011111011110110110000111010001111011110101100111001101000100010001110 e7fdbee4bb99e6f2ba4befbe90efbdb0e8f7ace4b89eefbdb7e4bb99e6f2ba4befbe90efbdb0e8f7ace6888e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)