To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???孺??儀??節??語⊥??嚴р?愉 00111111001111110011111110011011011111010011111100111111100010110101011000111111001111111001000011011111001111110011111110001100111010101000000111011011001111110011111110011010100011101000010010000010001111111001011011111001 3f3f3f9b7d3f3f8b563f3f90df3f3f8cea81db3f3f9a8e84823f96f9
EUC-JP ???孺??儀??節??語⊥??嚴р?愉 00111111001111110011111111010101110111100011111100111111101101011011011100111111001111111100000011100001001111110011111110111000111011001010001011011101001111110011111111010011111011101010011111100010001111111100110011111011 3f3f3fd5de3f3fb5b73f3fc0e13f3fb8eca2dd3f3fd3eea7e23fccfb
UTF-8 嶺뚮벊孺욜춱儀양춯節뚮츋語⊥띲럶嚴р뫗愉 1110111110100110101010111110101110011010101011101110101110110010100010101110010110101101101110101110110010011010100111001110110010110110101100011110010110000100100000001110110010010110100100011110110010110110101011111110011110101111100000001110101110011010101011101110110010111000100010111110100010101010100111101110001010001010101001011110101110011101101100101110101110011111101101101110010110011010101101001101000110000000111010111010101110010111111001101000010010001001 efa6abeb9aaeebb28ae5adbaec9a9cecb6b1e58480ec9691ecb6afe7af80eb9aaeecb88be8aa9ee28aa5eb9db2eb9fb6e59ab4d180ebab97e68489
UHC 嶺뚮벊孺욜춱儀양춯節뚮츋語⊥띲럶嚴р뫗愉 11100111101011011000110011101011100100111010110111101010111010001011111111100111101011011000110111101011111100001011111011100111101011011000110011101111101111011000110011101011101011101000011111100101110111101010000111010001100011011110001110001110100101011110010111110001101011001110001010010001101110011110101011110000 e7ad8ceb93adeae8bfe7ad8debf0bee7ad8cefbd8cebae87e5dea1d18de38e95e5f1ace291b9eaf0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)