To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?諷呈??諷呈????盤待?脹??諷呈? 0011111111100110100001011001001011100110001111110011111111100110100001011001001011100110001111110011111100111111001111111001010011010101100100011101001000111111100100101010111100111111001111111110011010000101100100101110011000111111 3fe68592e63f3fe68592e63f3f3f3f94d591d23f92af3f3fe68592e63f
EUC-JP ?諷呈??諷呈????盤待?脹??諷呈? 0011111111101011111001011100010011101000001111110011111111101011111001011100010011101000001111110011111100111111001111111100100011010111110000101101010000111111110001001011000100111111001111111110101111100101110001001110100000111111 3febe5c4e83f3febe5c4e83f3f3f3fc8d7c2d43fc4b13f3febe5c4e83f
UTF-8 뤋諷呈촏뤋諷呈쳪샘ㅾ렒盤待뤋脹챃뤋諷呈촜 111010111010010010001011111010001010101110110111111001011001000110001000111011001011010010001111111010111010010010001011111010001010101110110111111001011001000110001000111011001011001110101010111011001000001110011000111000111000010110111110111010111010000010010010111001111001101110100100111001011011111010000101111010111010010010001011111010001000010010111001111011001011000110000011111010111010010010001011111010001010101110110111111001011001000110001000111011001011010010011100 eba48be8abb7e59188ecb48feba48be8abb7e59188ecb3aaec8398e385beeba092e79ba4e5be85eba48be884b9ecb183eba48be8abb7e59188ecb49c
UHC 뤋諷呈촏뤋諷呈쳪샘ㅾ렒盤待뤋脹챃뤋諷呈촜 10001111101110111111100110100100111011111101000010101100010011101000111110111011111110011010010011101111110100001010101110001111101110111111100110100100111011101000111010100111110110101110111111010011111000101000111110111011111100111110110010101010010100111000111110111011111110011010010011101111110100001010110001010111 8fbbf9a4efd0ac4e8fbbf9a4efd0ab8fbbf9a4ee8ea7daefd3e28fbbf3ecaa538fbbf9a4efd0ac57

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)