To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN ???爰??音??}???爰??音??{^ 00111111001111110011111111100000101001110011111100111111100010011011100100111111001111110111110100111111001111110011111111100000101001110011111100111111100010011011100100111111001111110111101101011110 3f3f3fe0a73f3f89b93f3f7d3f3f3fe0a73f3f89b93f3f7b5e
EUC-JP ???爰??音??}???爰??音??{^ 00111111001111110011111111100000101010010011111100111111101100101011101100111111001111110111110100111111001111110011111111100000101010010011111100111111101100101011101100111111001111110111101101011110 3f3f3fe0a93f3fb2bb3f3f7d3f3f3fe0a93f3fb2bb3f3f7b5e
UTF-8 料겸뫁爰뤸략音깃퐦}料겸뫁爰뤸략音깃퐦{^ 111011111010011010111110111010101011001010111000111010111010101110000001111001111000100010110000111010111010010010111000111010111001111010110101111010011001111110110011111010101011100110000011111011011001000010100110011111011110111110100110101111101110101010110010101110001110101110101011100000011110011110001000101100001110101110100100101110001110101110011110101101011110100110011111101100111110101010111001100000111110110110010000101001100111101101011110 efa6beeab2b8ebab81e788b0eba4b8eb9eb5e99fb3eab983ed90a67defa6beeab2b8ebab81e788b0eba4b8eb9eb5e99fb3eab983ed90a67b5e
UHC 料겸뫁爰뤸략音깃퐦}料겸뫁爰뤸략音깃퐦{^ 111010001111011110110000111000101001000110100101111010101011101010001111111001101011011110101011111010111110010110110001111010101011110110001111011111011110100011110111101100001110001010010001101001011110101010111010100011111110011010110111101010111110101111100101101100011110101010111101100011110111101101011110 e8f7b0e291a5eaba8fe6b7abebe5b1eabd8f7de8f7b0e291a5eaba8fe6b7abebe5b1eabd8f7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)