To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 筌b?筍?????筌≪?泣??碎γ?筌 11100010101000111000001010000010001111111110001010100001001111110011111100111111001111110011111111100010101000111000000111100001001111111000101110000011001111110011111111100001111010101000001111000001001111111110001010100011 e2a382823fe2a13f3f3f3f3fe2a381e13f8b833f3fe1ea83c13fe2a3
EUC-JP 筌b?筍?????筌≪?泣??碎γ?筌 11100100101001011010001111100010001111111110010010100011001111110011111100111111001111110011111111100100101001011010001011100011001111111011010111100011001111110011111111100010111011001010011011000011001111111110010010100101 e4a5a3e23fe4a33f3f3f3f3fe4a5a2e33fb5e33f3fe2eca6c33fe4a5
UTF-8 筌b뫁筍쏉㏊戮곗숯筌≪눛泣쀯㎗碎γ럾筌 1110011110101101100011001110111110111101100000101110101110101011100000011110011110101101100011011110110010001111100010011110001110001111100010101110111110100111100100101110101010110011100101111110110010001000101011111110011110101101100011001110001010001001101010101110101110001000100110111110011010110011101000111110110010000000101011111110001110001110100101111110011110100010100011101100111010110011111010111001111110111110111001111010110110001100 e7ad8cefbd82ebab81e7ad8dec8f89e38f8aefa792eab397ec88afe7ad8ce289aaeb889be6b3a3ec80afe38e97e7a28eceb3eb9fbee7ad8c
UHC 筌b뫁筍쏉㏊戮곗숯筌≪눛泣쀯㎗碎γ럾筌 1110111110100111101000111110001010010001101001011110001011101100100110111110111110100111101101011110101110111101101100001110110010111101101000011110111110100111101000011110110010000111101100111110101111101000100101111110111110100111101000111110000111101111101001011110001110001110100110111110111110100111 efa7a3e291a5e2ec9befa7b5ebbdb0ecbda1efa7a1ec87b3ebe897efa7a3e1efa5e38e9befa7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)