To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?諷呈??諷呈????紐待?脹????腔 00111111111001101000010110010010111001100011111100111111111001101000010110010010111001100011111100111111001111110011111110010101010100101001000111010010001111111001001010101111001111110011111100111111001111111000110101101111 3fe68592e63f3fe68592e63f3f3f3f955291d23f92af3f3f3f3f8d6f
EUC-JP ?諷呈??諷呈????紐待?脹????腔 00111111111010111110010111000100111010000011111100111111111010111110010111000100111010000011111100111111001111110011111111001001101100111100001011010100001111111100010010110001001111110011111100111111001111111011100111010000 3febe5c4e83f3febe5c4e83f3f3f3fc9b3c2d43fc4b13f3f3f3fb9d0
UTF-8 뤋諷呈촊뤋諷呈쳪샘ㅾ렒紐待뤋脹찋샘ㅾ렒腔 111010111010010010001011111010001010101110110111111001011001000110001000111011001011010010001010111010111010010010001011111010001010101110110111111001011001000110001000111011001011001110101010111011001000001110011000111000111000010110111110111010111010000010010010111001111011010010010000111001011011111010000101111010111010010010001011111010001000010010111001111011001011000010001011111011001000001110011000111000111000010110111110111010111010000010010010111010001000010110010100 eba48be8abb7e59188ecb48aeba48be8abb7e59188ecb3aaec8398e385beeba092e7b490e5be85eba48be884b9ecb08bec8398e385beeba092e88594
UHC 뤋諷呈촊뤋諷呈쳪샘ㅾ렒紐待뤋脹찋샘ㅾ렒腔 10001111101110111111100110100100111011111101000010101100010010101000111110111011111110011010010011101111110100001010101110001111101110111111100110100100111011101000111010100111110100101110111111010011111000101000111110111011111100111110110010101001100011111011101111111001101001001110111010001110101001111100101110110111 8fbbf9a4efd0ac4a8fbbf9a4efd0ab8fbbf9a4ee8ea7d2efd3e28fbbf3eca98fbbf9a4ee8ea7cbb7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)