To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 烏??悅??鴦??}烏??悅??鴦??{^ 100010010100011100111111001111111111101010111101001111110011111111101001111100010011111100111111011111011000100101000111001111110011111111111010101111010011111100111111111010011111000100111111001111110111101101011110 89473f3ffabd3f3fe9f13f3f7d89473f3ffabd3f3fe9f13f3f7b5e
EUC-JP 烏?????鴦??}烏?????鴦??{^ 10110001101010000011111100111111001111110011111100111111111100101111001100111111001111110111110110110001101010000011111100111111001111110011111100111111111100101111001100111111001111110111101101011110 b1a83f3f3f3f3ff2f33f3f7db1a83f3f3f3f3ff2f33f3f7b5e
UTF-8 烏녿젺悅쎈젾鴦잙졁}烏녿젺悅쎈젾鴦잙졁{^ 111001111000001110001111111010111000010110111111111011001010000010111010111001101000001010000101111011001000111010001000111011001010000010111110111010011011010010100110111011001001111010011001111011001010000110000001011111011110011110000011100011111110101110000101101111111110110010100000101110101110011010000010100001011110110010001110100010001110110010100000101111101110100110110100101001101110110010011110100110011110110010100001100000010111101101011110 e7838feb85bfeca0bae68285ec8e88eca0bee9b4a6ec9e99eca1817de7838feb85bfeca0bae68285ec8e88eca0bee9b4a6ec9e99eca1817b5e
UHC 烏녿젺悅쎈젾鴦잙졁}烏녿젺悅쎈젾鴦잙졁{^ 111010001010000110000110111010111010000010101101111001101110110110111101111010111010000010110000111001001110110010011111111010111010000010110010011111011110100010100001100001101110101110100000101011011110011011101101101111011110101110100000101100001110010011101100100111111110101110100000101100100111101101011110 e8a186eba0ade6edbdeba0b0e4ec9feba0b27de8a186eba0ade6edbdeba0b0e4ec9feba0b27b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)