To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 鄒ケ闖ス貂会セ懶セ孥鄒ケ闖ス貂会セ懶セ孥B 111001111011111010111001111010001000111110111101111001101011100010001001111011111011111010011100111011111011111010011011011101101110011110111110101110011110100010001111101111011110011010111000100010011110111110111110100111001110111110111110100110110111011001000010 e7beb9e88fbde6b889efbe9cefbe9b76e7beb9e88fbde6b889efbe9cefbe9b7642
EUC-JP 鄒ケ闖ス貂会セ懶セ孥鄒ケ闖ス貂会セ懶セ孥B 1110111011000000100011101011100111101111111011111000111010111101111011001011101010110010111100011000111010111110110110001111000110001110101111101101010111010111111011101100000010001110101110011110111111101111100011101011110111101100101110101011001011110001100011101011111011011000111100011000111010111110110101011101011101000010 eec08eb9efef8ebdecbab2f18ebed8f18ebed5d7eec08eb9efef8ebdecbab2f18ebed8f18ebed5d742
UTF-8 鄒ケ闖ス貂会セ懶セ孥鄒ケ闖ス貂会セ懶セ孥B 11101001100001001001001011101111101111011011100111101001100101111001011011101111101111011011110111101000101100101000001011100100101111001001101011101111101111011011111011100110100001111011011011101111101111011011111011100101101011011010010111101001100001001001001011101111101111011011100111101001100101111001011011101111101111011011110111101000101100101000001011100100101111001001101011101111101111011011111011100110100001111011011011101111101111011011111011100101101011011010010101000010 e98492efbdb9e99796efbdbde8b282e4bc9aefbdbee687b6efbdbee5ada5e98492efbdb9e99796efbdbde8b282e4bc9aefbdbee687b6efbdbee5ada542
UHC 鄒?闖?貂??懶??鄒?闖?貂??懶??B 1111010111011011001111111111011111100110001111111111010110110000001111110011111111010100111110110011111100111111111101011101101100111111111101111110011000111111111101011011000000111111001111111101010011111011001111110011111101000010 f5db3ff7e63ff5b03f3fd4fb3f3ff5db3ff7e63ff5b03f3fd4fb3f3f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)