To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蠢?苑?珥?湲?邑?蠢?苑?珥?湲??六 111001011011111100111111100010011001000100111111111000001110000000111111100111111101000100111111100101110101011100111111111001011011111100111111100010011001000100111111111000001110000000111111100111111101000100111111001111111001100001011010 e5bf3f89913fe0e03f9fd13f97573fe5bf3f89913fe0e03f9fd13f3f985a
EUC-JP 蠢?苑?珥?湲?邑?蠢?苑?珥?湲?瑗六 1110101011000001001111111011000111110001001111111110000011100010001111111101111011010011001111111100110110111000001111111110101011000001001111111011000111110001001111111110000011100010001111111101111011010011001111111000111111001100110000001100111110111011 eac13fb1f13fe0e23fded33fcdb83feac13fb1f13fe0e23fded33f8fccc0cfbb
UTF-8 蠢렋苑렩珥렖湲렪邑렧蠢렋苑렩珥렖湲렪瑗六 111010001010000010100010111010111010000010001011111010001000101110010001111010111010000010101001111001111000111110100101111010111010000010010110111001101011100110110010111010111010000010101010111010011000001010010001111010111010000010100111111010001010000010100010111010111010000010001011111010001000101110010001111010111010000010101001111001111000111110100101111010111010000010010110111001101011100110110010111010111010000010101010111001111001000110010111111001011000010110101101 e8a0a2eba08be88b91eba0a9e78fa5eba096e6b9b2eba0aae98291eba0a7e8a0a2eba08be88b91eba0a9e78fa5eba096e6b9b2eba0aae79197e585ad
UHC 蠢렋苑렩珥렖湲렪邑렧蠢렋苑렩珥렖湲렪瑗六 11110001111000111000111010100010111010101011110110001110101101111110110010110100100011101010101111101010101110001000111010111000111010111110100110001110101101101111000111100011100011101010001011101010101111011000111010110111111011001011010010001110101010111110101010111000100011101011100011101010101111001101011110111111 f1e38ea2eabd8eb7ecb48eabeab88eb8ebe98eb6f1e38ea2eabd8eb7ecb48eabeab88eb8eabcd7bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)