To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 永??宥??孃??泣??爰⑤?揄??辱⑤?孺 1000100101101001001111110011111110010111010001110011111100111111100110110110111100111111001111111000101110000011001111110011111111100000101001111000011101000100001111111001110110001001001111110011111110010000010010101000011101000100001111111001101101111101 89693f3f97473f3f9b6f3f3f8b833f3fe0a787443f9d893f3f904a87443f9b7d
EUC-JP 永??宥??孃??泣??爰??揄??辱??孺 101100011100101000111111001111111100110110101000001111110011111111010101110100000011111100111111101101011110001100111111001111111110000010101001001111110011111111011001111010010011111100111111101111111010101100111111001111111101010111011110 b1ca3f3fcda83f3fd5d03f3fb5e33f3fe0a93f3fd9e93f3fbfab3f3fd5de
UTF-8 永띔벰宥닿쾲孃뉎끏泣당독爰⑤뒾揄먭쾱辱⑤퐦孺 111001101011000010111000111010111001110110010100111010111011001010110000111001011010111010100101111010111000101110111111111011001011111010110010111001011010110110000011111010111000100110001110111010111000000110001111111001101011001110100011111010111000101110111001111010111000111110000101111001111000100010110000111000101001000110100100111010111001001010111110111001101000111110000100111010111010100010101101111011001011111010110001111010001011111010110001111000101001000110100100111011011001000010100110111001011010110110111010 e6b0b8eb9d94ebb2b0e5aea5eb8bbfecbeb2e5ad83eb898eeb818fe6b3a3eb8bb9eb8f85e788b0e291a4eb92bee68f84eba8adecbeb1e8beb1e291a4ed90a6e5adba
UHC 永띔벰宥닿쾲孃뉎끏泣당독爰⑤뒾揄먭쾱辱⑤퐦孺 1110011110110101101101101110101010111010101010001110101011101001101101001110101010110010100010001110010110111110100001111110001110000101101111111110101111101000101101001110011110110101101101101110101010111010101010001110101110001010101101001110101011110001100100001110101010110010100001111110100110110100101010001110101110111101100011111110101011101000 e7b5b6eabaa8eae9b4eab288e5be87e385bfebe8b4e7b5b6eabaa8eb8ab4eaf190eab287e9b4a8ebbd8feae8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)