To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?諷呈??諷呈??諷呈??諷呈???? 001111111110011010000101100100101110011000111111001111111110011010000101100100101110011000111111001111111110011010000101100100101110011000111111001111111110011010000101100100101110011000111111001111110011111100111111 3fe68592e63f3fe68592e63f3fe68592e63f3fe68592e63f3f3f3f
EUC-JP ?諷呈??諷呈??諷呈??諷呈???? 001111111110101111100101110001001110100000111111001111111110101111100101110001001110100000111111001111111110101111100101110001001110100000111111001111111110101111100101110001001110100000111111001111110011111100111111 3febe5c4e83f3febe5c4e83f3febe5c4e83f3febe5c4e83f3f3f3f
UTF-8 뤋諷呈촓뤋諷呈촑뤋諷呈촔뤋諷呈쳪샘ㅾ렒 111010111010010010001011111010001010101110110111111001011001000110001000111011001011010010010011111010111010010010001011111010001010101110110111111001011001000110001000111011001011010010010001111010111010010010001011111010001010101110110111111001011001000110001000111011001011010010010100111010111010010010001011111010001010101110110111111001011001000110001000111011001011001110101010111011001000001110011000111000111000010110111110111010111010000010010010 eba48be8abb7e59188ecb493eba48be8abb7e59188ecb491eba48be8abb7e59188ecb494eba48be8abb7e59188ecb3aaec8398e385beeba092
UHC 뤋諷呈촓뤋諷呈촑뤋諷呈촔뤋諷呈쳪샘ㅾ렒 1000111110111011111110011010010011101111110100001010110001010001100011111011101111111001101001001110111111010000101011000100111110001111101110111111100110100100111011111101000010101100010100101000111110111011111110011010010011101111110100001010101110001111101110111111100110100100111011101000111010100111 8fbbf9a4efd0ac518fbbf9a4efd0ac4f8fbbf9a4efd0ac528fbbf9a4efd0ab8fbbf9a4ee8ea7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)