To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 悅??節g?炎?n}悅??節g?炎?n{^ 1111101010111101001111110011111110010000110111111000001010000111001111111000100110001010001111110110111001111101111110101011110100111111001111111001000011011111100000101000011100111111100010011000101000111111011011100111101101011110 fabd3f3f90df82873f898a3f6e7dfabd3f3f90df82873f898a3f6e7b5e
EUC-JP ???節g?炎?n}???節g?炎?n{^ 001111110011111100111111110000001110000110100011111001110011111110110001111010100011111101101110011111010011111100111111001111111100000011100001101000111110011100111111101100011110101000111111011011100111101101011110 3f3f3fc0e1a3e73fb1ea3f6e7d3f3f3fc0e1a3e73fb1ea3f6e7b5e
UTF-8 悅롳풏節g츕炎챣n}悅롳풏節g츕炎챣n{^ 1110011010000010100001011110101110100001101100111110110110010010100011111110011110101111100000001110111110111101100001111110110010111000100101011110011110000010100011101110110010110001101000110110111001111101111001101000001010000101111010111010000110110011111011011001001010001111111001111010111110000000111011111011110110000111111011001011100010010101111001111000001010001110111011001011000110100011011011100111101101011110 e68285eba1b3ed928fe7af80efbd87ecb895e7828eecb1a36e7de68285eba1b3ed928fe7af80efbd87ecb895e7828eecb1a36e7b5e
UHC 悅롳풏節g츕炎챣n}悅롳풏節g츕炎챣n{^ 11100110111011011000111011101111101111101001001111101111101111011010001111100111101011101000111111100110111110101010101001101111011011100111110111100110111011011000111011101111101111101001001111101111101111011010001111100111101011101000111111100110111110101010101001101111011011100111101101011110 e6ed8eefbe93efbda3e7ae8fe6faaa6f6e7de6ed8eefbe93efbda3e7ae8fe6faaa6f6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)