To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}v?????????}vB 0011111100111111001111110011111100111111001111110011111100111111001111110111110101110110001111110011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f3f7d7642
SJIS-WIN 嶸??項???譯?}v嶸??項???譯?}vB 1111101010110100001111110011111110001101100000000011111100111111001111111110011010100001001111110111110101110110111110101011010000111111001111111000110110000000001111110011111100111111111001101010000100111111011111010111011001000010 fab43f3f8d803f3f3fe6a13f7d76fab43f3f8d803f3f3fe6a13f7d7642
EUC-JP 嶸??項???譯?}v嶸??項???譯?}vB 10001111101110111111010000111111001111111011100111100000001111110011111100111111111011001010001100111111011111010111011010001111101110111111010000111111001111111011100111100000001111110011111100111111111011001010001100111111011111010111011001000010 8fbbf43f3fb9e03f3f3feca33f7d768fbbf43f3fb9e03f3f3feca33f7d7642
UTF-8 嶸뤹윂項싵轢랂譯캟}v嶸뤹윂項싵轢랂譯캟}vB 1110010110110110101110001110101110100100101110011110110010011100100000101110100110100000100001011110110010001011101101011110111110100110100011011110101110011110100000101110100010101101101011111110110010111010100111110111110101110110111001011011011010111000111010111010010010111001111011001001110010000010111010011010000010000101111011001000101110110101111011111010011010001101111010111001111010000010111010001010110110101111111011001011101010011111011111010111011001000010 e5b6b8eba4b9ec9c82e9a085ec8bb5efa68deb9e82e8adafecba9f7d76e5b6b8eba4b9ec9c82e9a085ec8bb5efa68deb9e82e8adafecba9f7d7642
UHC 嶸뤹윂項싵轢랂譯캟}v嶸뤹윂項싵轢랂譯캟}vB 1110011110101110100011111110011110011111100011011111101010100011100110101110111011100110101111001000110111101110111001101011101110110000010001100111110101110110111001111010111010001111111001111001111110001101111110101010001110011010111011101110011010111100100011011110111011100110101110111011000001000110011111010111011001000010 e7ae8fe79f8dfaa39aeee6bc8deee6bbb0467d76e7ae8fe79f8dfaa39aeee6bc8deee6bbb0467d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)