To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲?????韋??霓??竊??儒??甕?? 111000011001111100111111001111110011111100111111001111111110100011101000001111110011111111101000101111010011111100111111111000101000011000111111001111111000111011110010001111110011111111100001010100000011111100111111 e19f3f3f3f3f3fe8e83f3fe8bd3f3fe2863f3f8ef23f3fe1503f3f
EUC-JP 癲?????韋??霓??竊??儒??甕?? 111000101010000100111111001111110011111100111111001111111111000011101010001111110011111111110000101111110011111100111111111000111110011000111111001111111011110011110100001111110011111111100001101100010011111100111111 e2a13f3f3f3f3ff0ea3f3ff0bf3f3fe3e63f3fbcf43f3fe1b13f3f
UTF-8 癲띿슜杻⑵린韋용렢霓낅뿫竊먬돍儒룹컜甕곌퓖 111001111001100110110010111010111001110110111111111011001000101010011100111011111010011110001000111000101001000110110101111010111010011010110000111010011001111110001011111011001001101010101001111010111010000010100010111010011001110010010011111010111000001010000101111010111011111110101011111001111010101110001010111010111010100010101100111010111000111110001101111001011000010010010010111010111010001110111001111011001011101110011100111001111001010010010101111010101011001110001100111011011001001110010110 e799b2eb9dbfec8a9cefa788e291b5eba6b0e99f8bec9aa9eba0a2e99c93eb8285ebbfabe7ab8aeba8aceb8f8de58492eba3b9ecbb9ce79495eab38ced9396
UHC 癲띿슜杻⑵린韋용렢霓낅뿫竊먬돍儒룹컜甕곌퓖 111011111010011010001101111011001001101010101001111010101111010010101001111010001011100010110000111010101101111110111111111010111000111010110011111001111110011110000101111010111001011110101011111011111011110010010000111010011000100110011011111010101110001110110111111011001011000010000111111010001011100010110000111010101011111110000001 efa68dec9aa9eaf4a9e8b8b0eadfbfeb8eb3e7e785eb97abefbc90e9899beae3b7ecb087e8b8b0eabf81

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)