To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 霓?????矣?き語ζ?醫??孃る?儒 11101000101111010011111100111111001111110011111100111111111000011110000100111111100000101010101110001100111010101000001111000100001111111110011111001110001111110011111110011011011011111000001011101001001111111000111011110010 e8bd3f3f3f3f3fe1e13f82ab8cea83c43fe7ce3f3f9b6f82e93f8ef2
EUC-JP 霓?????矣?き語ζ?醫??孃る?儒 11110000101111110011111100111111001111110011111100111111111000101110001100111111101001001010110110111000111011001010011011000110001111111110111011010000001111110011111111010101110100001010010011101011001111111011110011110100 f0bf3f3f3f3f3fe2e33fa4adb8eca6c63feed03f3fd5d0a4eb3fbcf4
UTF-8 霓낅뜄璘⒵뤃矣낅き語ζ략醫덇뎃孃る뿭儒 1110100110011100100100111110101110000010100001011110101110011100100001001110111110100111101011111110001010010010101101011110101110100100100000111110011110011111101000111110101110000010100001011110001110000001100011011110100010101010100111101100111010110110111010111001111010110101111010011000011010101011111010111000110110000111111010111000111010000011111001011010110110000011111000111000001010001011111010111011111110101101111001011000010010010010 e99c93eb8285eb9c84efa7afe292b5eba483e79fa3eb8285e3818de8aa9eceb6eb9eb5e986abeb8d87eb8e83e5ad83e3828bebbfade58492
UHC 霓낅뜄璘⒵뤃矣낅き語ζ략醫덇뎃孃る뿭儒 1110011111100111100001011110101110001101100010001110110011011110101010011110011010001111101101001110101111111000100001011110101110101010101011011110010111011110101001011110011010110111101010111110110010100010100010001110101010110101101010111110010110111110101010101110101110010111101011011110101011100011 e7e785eb8d88ecdea9e68fb4ebf885ebaaade5dea5e6b7abeca288eab5abe5beaaeb97adeae3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)