To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 魚??宜??蹂??冗??筍?┐誘??吟 100010111001101100111111001111111000101101011000001111110011111111100110111110000011111100111111100011111110011100111111001111111110001010100001001111111000010010100010100101110101010100111111001111111000101111100001 8b9b3f3f8b583f3fe6f83f3f8fe73f3fe2a13f84a297553f3f8be1
EUC-JP 魚??宜??蹂??冗??筍?┐誘??吟 101101011111101100111111001111111011010110111001001111110011111111101100111110100011111100111111101111101110100100111111001111111110010010100011001111111010100010100100110011011011011000111111001111111011011011100011 b5fb3f3fb5b93f3fecfa3f3fbee93f3fe4a33fa8a4cdb63f3fb6e3
UTF-8 魚잕랜宜룝슭蹂좊쨨冗밴랜筍잞┐誘띾뿪吟 111010011010110110011010111011001001111010010101111010111001111010011100111001011010111010011100111010111010001110011101111011001000101010101101111010001011100110000010111011001010001010001010111011001010100010101000111001011000011010010111111010111011000010110100111010111001111010011100111001111010110110001101111011001001111010011110111000101001010010010000111010001010101010011000111010111001110110111110111010111011111110101010111001011001000010011111 e9ad9aec9e95eb9e9ce5ae9ceba39dec8aade8b982eca28aeca8a8e58697ebb0b4eb9e9ce7ad8dec9e9ee29490e8aa98eb9dbeebbfaae5909f
UHC 魚잕랜宜룝슭蹂좊쨨冗밴랜筍잞┐誘띾뿪吟 1110010111100000100111111110101010110111101000111110101111110001101101111110010010111101101111101110101110110011101000001110101110100100100000111110100110110111101110011110101010110111101000111110001011101100100111111110111110100110101001001110101110101111100011011110101110010111101010101110101111100001 e5e09feab7a3ebf1b7e4bdbeebb3a0eba483e9b7b9eab7a3e2ec9fefa6a4ebaf8deb97aaebe1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)