To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鸚??孺??宥??烏?????恂ろ?亦??? 1110101001011111001111110011111110011011011111010011111100111111100101110100011100111111001111111000100101000111001111110011111100111111001111110011111110011100100101101000001011101011001111111001011010010010001111110011111100111111 ea5f3f3f9b7d3f3f97473f3f89473f3f3f3f3f9c9682eb3f96923f3f3f
EUC-JP 鸚??孺??宥??烏??庾??恂ろ?亦??沅 111100111100000000111111001111111101010111011110001111110011111111001101101010000011111100111111101100011010100000111111001111111000111110111100110011100011111100111111110101111111011010100100111011010011111111001011111100100011111100111111100011111100011011101001 f3c03f3fd5de3f3fcda83f3fb1a83f3f8fbcce3f3fd7f6a4ed3fcbf23f3f8fc6e9
UTF-8 鸚쒓퍔孺썹뵳宥몃쨨烏겸뫁庾썲톹恂ろ뮑亦낆쉪沅 111010011011100010011010111011001001001010010011111011011000110110010100111001011010110110111010111011001000110110111001111010111011010110110011111001011010111010100101111010111010101010000011111011001010100010101000111001111000001110001111111010101011001010111000111010111010101110000001111001011011101010111110111011001000110110110010111011011000011010111001111001101000000110000010111000111000001010001101111010111010111010010001111001001011101010100110111010111000001010000110111011001000100110101010111001101011001010000101 e9b89aec9293ed8d94e5adbaec8db9ebb5b3e5aea5ebaa83eca8a8e7838feab2b8ebab81e5babeec8db2ed86b9e68182e3828debae91e4baa6eb8286ec89aae6b285
UHC 鸚쒓퍔孺썹뵳宥몃쨨烏겸뫁庾썲톹恂ろ뮑亦낆쉪沅 1110010110100100100111001110101010111011100010111110101011101000101111011110011110010100101100011110101011101001101110001110101110100100100000111110100010100001101100001110001010010001101001011110101011101100101111011110010110110111100011011110001011100001101010101110110110010010100111011110011010110010100001011110110010011010100001001110101010110110 e5a49ceabb8beae8bde794b1eae9b8eba483e8a1b0e291a5eaecbde5b78de2e1aaed929de6b285ec9a84eab6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)