To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 魚??揖??蹂??壓??艤??猷??鵝??幼 100010111001101100111111001111111001011101001011001111110011111111100110111110000011111100111111100110101101100000111111001111111110010001111110001111110011111110010111010100010011111100111111111010100100000000111111001111111001011101100011 8b9b3f3f974b3f3fe6f83f3f9ad83f3fe47e3f3f97513f3fea403f3f9763
EUC-JP 魚??揖??蹂??壓??艤??猷??鵝??幼 101101011111101100111111001111111100110110101100001111110011111111101100111110100011111100111111110101001101101000111111001111111110011111011111001111110011111111001101101100100011111100111111111100111010000100111111001111111100110111000100 b5fb3f3fcdac3f3fecfa3f3fd4da3f3fe7df3f3fcdb23f3ff3a13f3fcdc4
UTF-8 魚잙쉴揖겻슭蹂좎젘壓믩쓹艤븝쬅猷밸퉾鵝얠떘幼 111010011010110110011010111011001001111010011001111011001000100110110100111001101000111110010110111010101011001010111011111011001000101010101101111010001011100110000010111011001010001010001110111011001010000010011000111001011010001110010011111010111010111110101001111011001001001110111001111010001000100110100100111010111011100010011101111011001010110010000101111001111000110010110111111010111011000010111000111011011000100110111110111010011011010110011101111011001001011010100000111010111001011010011000111001011011100110111100 e9ad9aec9e99ec89b4e68f96eab2bbec8aade8b982eca28eeca098e5a393ebafa9ec93b9e889a4ebb89decac85e78cb7ebb0b8ed89bee9b59dec96a0eb9698e5b9bc
UHC 魚잙쉴揖겻슭蹂좎젘壓믩쓹艤븝쬅猷밸퉾鵝얠떘幼 1110010111100000100111111110101110111101101011111110101111100111101100001110010010111101101111101110101110110011101000001110110010100000100101001110010011100010100100101110101110011101100101011110101111111010101110101110111110100110100111001110101110100011101110011110101110111001100101101110010010111101101111101110110010001011101011101110101011101010 e5e09febbdafebe7b0e4bdbeebb3a0eca094e4e292eb9d95ebfabaefa69ceba3b9ebb996e4bdbeec8baeeaea

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)