To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 濡??濡?┸??n}濡??濡?┸??n{^ 100101000100011100111111001111111001010001000111001111111000010010111101001111110011111101101110011111011001010001000111001111110011111110010100010001110011111110000100101111010011111100111111011011100111101101011110 94473f3f94473f84bd3f3f6e7d94473f3f94473f84bd3f3f6e7b5e
EUC-JP 濡??濡?┸繇?n}濡??濡?┸繇?n{^ 11000111101010000011111100111111110001111010100000111111101010001011111110001111110101001101000100111111011011100111110111000111101010000011111100111111110001111010100000111111101010001011111110001111110101001101000100111111011011100111101101011110 c7a83f3fc7a83fa8bf8fd4d13f6e7dc7a83f3fc7a83fa8bf8fd4d13f6e7b5e
UTF-8 濡싲젘濡섓┸繇푏n}濡싲젘濡섓┸繇푏n{^ 1110011010111111101000011110110010001011101100101110110010100000100110001110011010111111101000011110110010000100100100111110001010010100101110001110011110111001100001111110110110010001100011110110111001111101111001101011111110100001111011001000101110110010111011001010000010011000111001101011111110100001111011001000010010010011111000101001010010111000111001111011100110000111111011011001000110001111011011100111101101011110 e6bfa1ec8bb2eca098e6bfa1ec8493e294b8e7b987ed918f6e7de6bfa1ec8bb2eca098e6bfa1ec8493e294b8e7b987ed918f6e7b5e
UHC 濡싲젘濡섓┸繇푏n}濡싲젘濡섓┸繇푏n{^ 11101011101000011001101011101011101000001001010011101011101000011001100011101111101001101011111111101001101000111011111001010110011011100111110111101011101000011001101011101011101000001001010011101011101000011001100011101111101001101011111111101001101000111011111001010110011011100111101101011110 eba19aeba094eba198efa6bfe9a3be566e7deba19aeba094eba198efa6bfe9a3be566e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)