To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????a??????????????? 001111110011111100111111001111110011111101100001001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f613f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?暴???a?汗荊?權???暴??汗荊?究 0011111110010110010111000011111100111111001111110110000100111111100010101011111010001100011101000011111110011110110111000011111100111111001111111001011001011100001111110011111110001010101111101000110001110100001111111000101110000110 3f965c3f3f3f613f8abe8c743f9edc3f3f3f965c3f3f8abe8c743f8b86
EUC-JP ?暴???a?汗荊?權???暴??汗荊?究 0011111111001011101111010011111100111111001111110110000100111111101101001100000010110111110101010011111111011100110111100011111100111111001111111100101110111101001111110011111110110100110000001011011111010101001111111011010111100110 3fcbbd3f3f3f613fb4c0b7d53fdcde3f3f3fcbbd3f3fb4c0b7d53fb5e6
UTF-8 뤋暴첂샘렑a뤋汗荊㉶權샘렕뤋暴쫆뤋汗荊㉶究 11101011101001001000101111100110100110101011010011101100101100101000001011101100100000111001100011101011101000001001000101100001111010111010010010001011111001101011000110010111111010001000110110001010111000111000100110110110111001101010110010001010111011001000001110011000111010111010000010010101111010111010010010001011111001101001101010110100111011001010101110000110111010111010010010001011111001101011000110010111111010001000110110001010111000111000100110110110111001111010100110110110 eba48be69ab4ecb282ec8398eba09161eba48be6b197e88d8ae389b6e6ac8aec8398eba095eba48be69ab4ecab86eba48be6b197e88d8ae389b6e7a9b6
UHC 뤋暴첂샘렑a뤋汗荊㉶權샘렕뤋暴쫆뤋汗荊㉶究 1000111110111011111110001110110010101010100011111011101111111001100011101010011001100001100011111011101111111001110100101111101110101010101010001100011111001111111011011011101111111001100011101010101010001111101110111111100011101100101001100110000110001111101110111111100111010010111110111010101010101000110001111100111110111100 8fbbf8ecaa8fbbf98ea6618fbbf9d2fbaaa8c7cfedbbf98eaa8fbbf8eca6618fbbf9d2fbaaa8c7cfbc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)