To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????E 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f45
SJIS-WIN 枳???風?胡棒??枳???風?胡棒??E 1001111001101011001111110011111100111111100101011001011100111111100011001101001110010110010111110011111100111111100111100110101100111111001111110011111110010101100101110011111110001100110100111001011001011111001111110011111101000101 9e6b3f3f3f95973f8cd3965f3f3f9e6b3f3f3f95973f8cd3965f3f3f45
EUC-JP 枳?雩?風?胡棒??枳?雩?風?胡棒??E 110110111100110000111111100011111110011011111010001111111100100111110111001111111011100011010101110010111100000000111111001111111101101111001100001111111000111111100110111110100011111111001001111101110011111110111000110101011100101111000000001111110011111101000101 dbcc3f8fe6fa3fc9f73fb8d5cbc03f3fdbcc3f8fe6fa3fc9f73fb8d5cbc03f3f45
UTF-8 枳렟雩렮風렩胡棒렲뻠枳렟雩렮風렩胡棒렲뼙E 11100110100111101011001111101011101000001001111111101001100110111010100111101011101000001010111011101001101000101010100011101011101000001010100111101000100000111010000111100110101000111001001011101011101000001011001011101011101110111010000011100110100111101011001111101011101000001001111111101001100110111010100111101011101000001010111011101001101000101010100011101011101000001010100111101000100000111010000111100110101000111001001011101011101000001011001011101011101111001001100101000101 e69eb3eba09fe99ba9eba0aee9a2a8eba0a9e883a1e6a392eba0b2ebbba0e69eb3eba09fe99ba9eba0aee9a2a8eba0a9e883a1e6a392eba0b2ebbc9945
UHC 枳렟雩렮風렩胡棒렲뻠枳렟雩렮風렩胡棒렲뼙E 1111001010101100100011101011000011101001111011001000111010111011111110011010011010001110101101111111101111010111110111001110101010001110101111111011101110111010111100101010110010001110101100001110100111101100100011101011101111111001101001101000111010110111111110111101011111011100111010101000111010111111101110111100001101000101 f2ac8eb0e9ec8ebbf9a68eb7fbd7dcea8ebfbbbaf2ac8eb0e9ec8ebbf9a68eb7fbd7dcea8ebfbbc345

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)