To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 崖ル????凹?????窈?ぉ???倭〓?? 1000101001010010100000111000101100111111001111110011111100111111100010011001101000111111001111110011111100111111001111111110001001110111001111111000001010100111001111110011111100111111100110000110000010000001101011000011111100111111 8a52838b3f3f3f3f899a3f3f3f3f3fe2773f82a73f3f3f986081ac3f3f
EUC-JP 崖ル????凹?????窈?ぉ???倭〓?? 1011001110110011101001011110101100111111001111110011111100111111101100011111101000111111001111110011111100111111001111111110001111011000001111111010010010101001001111110011111100111111110011111100000110100010101011100011111100111111 b3b3a5eb3f3f3f3fb1fa3f3f3f3f3fe3d83fa4a93f3f3fcfc1a2ae3f3f
UTF-8 崖ル씟溜곕젷凹좊뙎溜곕젨窈뚮ぉ溜곕젾倭〓젔力 111001011011010010010110111000111000001110101011111011001001010010011111111011111010011110001011111010101011001110010101111011001010000010110111111001011000011110111001111011001010001010001010111010111001100110001110111011111010011110001011111010101011001110010101111011001010000010101000111001111010101010001000111010111001101010101110111000111000000110001001111011111010011110001011111010101011001110010101111011001010000010111110111001011000000010101101111000111000000010010011111011001010000010010100111011111010011010001010 e5b496e383abec949fefa78beab395eca0b7e587b9eca28aeb998eefa78beab395eca0a8e7aa88eb9aaee38189efa78beab395eca0bee580ade38093eca094efa68a
UHC 崖ル씟溜곕젷凹좊뙎溜곕젨窈뚮ぉ溜곕젾倭〓젔力 1110010011110000101010111110101110011101101100111110101011111110101100001110101110100000101010111110100011101010101000001110101110001100100100111110101011111110101100001110101110100000101000001110100110100001100011001110101110101010101010011110101011111110101100001110101110100000101100001110100011011110101000011110101110100000100100101110011010110011 e4f0abeb9db3eafeb0eba0abe8eaa0eb8c93eafeb0eba0a0e9a18cebaaa9eafeb0eba0b0e8dea1eba092e6b3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)