To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???巡??淫??閻??乳??誘れ??ы? 00111111001111110011111110001111100001000011111100111111100010001111101000111111001111111110100010000101001111110011111110010011111110110011111100111111100101110101010110000010111010100011111100111111100001001000110100111111 3f3f3f8f843f3f88fa3f3fe8853f3f93fb3f3f975582ea3f3f848d3f
EUC-JP ???巡??淫??閻??乳??誘れ??ы? 00111111001111110011111110111101111001000011111100111111101100001111110000111111001111111110111111100101001111110011111111000110111111010011111100111111110011011011011010100100111011000011111100111111101001111110110100111111 3f3f3fbde43f3fb0fc3f3fefe53f3fc6fd3f3fcdb6a4ec3f3fa7ed3f
UTF-8 捻뀀갭巡껓쭪淫딅돎閻롡돦乳븝쬆誘れ삖嶪ы뇫 1110111110100110101001001110101110000000100000001110101010110000101011011110010110110111101000011110101010111011100100111110110010101101101010101110011010110111101010111110101110010100100001011110101110001111100011101110100110010110101110111110101110100001101000011110101110001111101001101110010010111001101100111110101110111000100111011110110010101100100001101110100010101010100110001110001110000010100011001110110010000010100101101110010110110110101010101101000110001011111010111000011110101011 efa6a4eb8080eab0ade5b7a1eabb93ecadaae6b7abeb9485eb8f8ee996bbeba1a1eb8fa6e4b9b3ebb89decac86e8aa98e3828cec8296e5b6aad18beb87ab
UHC 捻뀀갭巡껓쭪淫딅돎閻롡돦乳븝쬆誘れ삖嶪ы뇫 111001101111011110110010111010111011000010111000111000101101111010000011111011111010011110011110111010111110001010001010111010111011010110111010111001111010001010001110111000101000100110101010111010101110000110111010111011111010011010011101111010111010111110101010111011001001100010011010111001011111010110101100111011011000011110010001 e6f7b2ebb0b8e2de83efa79eebe28aebb5bae7a28ee289aaeae1baefa69debafaaec989ae5f5aced8791

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)