To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN ????坎???瞿}????坎???瞿{^ 00111111001111110011111100111111100110101010101000111111001111110011111111100001110110000111110100111111001111110011111100111111100110101010101000111111001111110011111111100001110110000111101101011110 3f3f3f3f9aaa3f3f3fe1d87d3f3f3f3f9aaa3f3f3fe1d87b5e
EUC-JP ????坎???瞿}????坎???瞿{^ 00111111001111110011111100111111110101001010110000111111001111110011111111100010110110100111110100111111001111110011111100111111110101001010110000111111001111110011111111100010110110100111101101011110 3f3f3f3fd4ac3f3f3fe2da7d3f3f3f3fd4ac3f3f3fe2da7b5e
UTF-8 쒔렜쑈뤰坎쳩옛쵌瞿}쒔렜쑈뤰坎쳩옛쵌瞿{^ 111011001001001010010100111010111010000010011100111011001001000110001000111010111010010010110000111001011001110110001110111011001011001110101001111011001001100010011011111011001011010110001100111001111001111010111111011111011110110010010010100101001110101110100000100111001110110010010001100010001110101110100100101100001110010110011101100011101110110010110011101010011110110010011000100110111110110010110101100011001110011110011110101111110111101101011110 ec9294eba09cec9188eba4b0e59d8eecb3a9ec989becb58ce79ebf7dec9294eba09cec9188eba4b0e59d8eecb3a9ec989becb58ce79ebf7b5e
UHC 쒔렜쑈뤰坎쳩옛쵌瞿}쒔렜쑈뤰坎쳩옛쵌瞿{^ 101111101010110110001110101011101011111010100100100011111101111011001010111011001010101110001110101111111011111010101100100011101100111110111010011111011011111010101101100011101010111010111110101001001000111111011110110010101110110010101011100011101011111110111110101011001000111011001111101110100111101101011110 bead8eaebea48fdecaecab8ebfbeac8ecfba7dbead8eaebea48fdecaecab8ebfbeac8ecfba7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)