To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 勇??維??懿??孃る????揄щ?壤 100101110100010100111111001111111000100011011011001111110011111110011100111100100011111100111111100110110110111110000010111010010011111100111111001111110011111110011101100010011000010010001011001111111001101011011111 97453f3f88db3f3f9cf23f3f9b6f82e93f3f3f3f9d89848b3f9adf
EUC-JP 勇??維??懿??孃る?佾??揄щ?壤 1100110110100110001111110011111110110000110111010011111100111111110110001111010000111111001111111101010111010000101001001110101100111111100011111011000011111011001111110011111111011001111010011010011111101011001111111101010011100001 cda63f3fb0dd3f3fd8f43f3fd5d0a4eb3f8fb0fb3f3fd9e9a7eb3fd4e1
UTF-8 勇싲즾維쀨굜懿얇룋孃る슡佾띸춯揄щ㎥壤 1110010110001011100001111110110010001011101100101110110010100110101111101110011110110110101011011110110010000000101010001110101010110101100111001110011010000111101111111110110010010110100001111110101110100011100010111110010110101101100000111110001110000010100010111110110010001010101000011110010010111101101111101110101110011101101110001110110010110110101011111110011010001111100001001101000110001001111000111000111010100101111001011010001110100100 e58b87ec8bb2eca6bee7b6adec80a8eab59ce687bfec9687eba38be5ad83e3828bec8aa1e4bdbeeb9db8ecb6afe68f84d189e38ea5e5a3a4
UHC 勇싲즾維쀨굜懿얇룋孃る슡佾띸춯揄щ㎥壤 1110100110111000100110101110101110100011100100001110101110101011100101111110100010000010100001001110101111110011101111101110001110001111100010101110010110111110101010101110101110011010101011011110110011101011100011011110011110101101100011001110101011110001101011001110101110100111101010011110010110111101 e9b89aeba390ebab97e88284ebf3bee38f8ae5beaaeb9aadeceb8de7ad8ceaf1aceba7a9e5bd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)