To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 雍??肉f?矣??如???耶??韋?ぜ幽 1110100010110100001111110011111110010011111101111000001010000110001111111110000111100001001111110011111110010100010000000011111100111111001111111001011011101011001111110011111111101000111010000011111110000010101110101001011101001000 e8b43f3f93f782863fe1e13f3f94403f3f3f96eb3f3fe8e83f82ba9748
EUC-JP 雍??肉f?矣??如???耶??韋?ぜ幽 1111000010110110001111110011111111000110111110011010001111100110001111111110001011100011001111110011111111000111101000010011111100111111001111111100110011101101001111110011111111110000111010100011111110100100101111001100110110101001 f0b63f3fc6f9a3e63fe2e33f3fc7a13f3f3fcced3f3ff0ea3fa4bccda9
UTF-8 雍우궠肉f뤃矣묒춷如붞쇽폁耶껁굥韋뤺ぜ幽 111010011001101110001101111011001001101010110000111010101011011010100000111010001000001010001001111011111011110110000110111010111010010010000011111001111001111110100011111010111010110010010010111011001011011010110111111001011010011010000010111010111011011010011110111011001000011110111101111011011000111110000001111010001000000010110110111010101011101110000001111010101011010110100101111010011001111110001011111010111010010010111010111000111000000110011100111001011011100110111101 e99b8dec9ab0eab6a0e88289efbd86eba483e79fa3ebac92ecb6b7e5a682ebb69eec87bded8f81e880b6eabb81eab5a5e99f8beba4bae3819ce5b9bd
UHC 雍우궠肉f뤃矣묒춷如붞쇽폁耶껁굥韋뤺ぜ幽 11101000101111001011111111101100100000101011001111101011101111111010001111100110100011111011010011101011111110001001000111101100101011011001001111100101111111011001010011001110101111001110111110111100100100001110010110101101100000111110001110000010100010111110101011011111100011111110100010101010101111001110101011101011 e8bcbfec82b3ebbfa3e68fb4ebf891ecad93e5fd94cebcefbc90e5ad83e3828beadf8fe8aabceaeb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)