To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 筌≪?泣??碎κ?筌l?媛??碎γ?筌l?? 11100010101000111000000111100001001111111000101110000011001111110011111111100001111010101000001111001000001111111110001010100011100000101000110000111111100101010101000100111111001111111110000111101010100000111100000100111111111000101010001110000010100011000011111100111111 e2a381e13f8b833f3fe1ea83c83fe2a3828c3f95513f3fe1ea83c13fe2a3828c3f3f
EUC-JP 筌≪?泣??碎κ?筌l?媛??碎γ?筌l?? 11100100101001011010001011100011001111111011010111100011001111110011111111100010111011001010011011001010001111111110010010100101101000111110110000111111110010011011001000111111001111111110001011101100101001101100001100111111111001001010010110100011111011000011111100111111 e4a5a2e33fb5e33f3fe2eca6ca3fe4a5a3ec3fc9b23f3fe2eca6c33fe4a5a3ec3f3f
UTF-8 筌≪눛泣숋㎖碎κ갸筌l쥙媛뱄㎗碎γ럾筌l뮇李 11100111101011011000110011100010100010011010101011101011100010001001101111100110101100111010001111101100100010001000101111100011100011101001011011100111101000101000111011001110101110101110101010110000101110001110011110101101100011001110111110111101100011001110110010100101100110011110010110101010100110111110101110110001100001001110001110001110100101111110011110100010100011101100111010110011111010111001111110111110111001111010110110001100111011111011110110001100111010111010111010000111111011111010011110100001 e7ad8ce289aaeb889be6b3a3ec888be38e96e7a28ecebaeab0b8e7ad8cefbd8ceca599e5aa9bebb184e38e97e7a28eceb3eb9fbee7ad8cefbd8cebae87efa7a1
UHC 筌≪눛泣숋㎖碎κ갸筌l쥙媛뱄㎗碎γ럾筌l뮇李 1110111110100111101000011110110010000111101100111110101111101000100110011110111110100111101000101110000111101111101001011110101010110000101111001110111110100111101000111110110010100010100011101110101010110000101110011110111110100111101000111110000111101111101001011110001110001110100110111110111110100111101000111110110010010010100101101110110010110000 efa7a1ec87b3ebe899efa7a2e1efa5eab0bcefa7a3eca28eeab0b9efa7a3e1efa5e38e9befa7a3ec9296ecb0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)