To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ??虞?垣企????霽???虞?垣企????霽?^ 001111110011111110001011111100010011111110001010010111111000101011101001001111110011111100111111001111111110100011000111001111110011111100111111100010111111000100111111100010100101111110001010111010010011111100111111001111110011111111101000110001110011111101011110 3f3f8bf13f8a5f8ae93f3f3f3fe8c73f3f3f8bf13f8a5f8ae93f3f3f3fe8c73f5e
EUC-JP ??虞?垣企????霽???虞?垣企????霽?^ 001111110011111110110110111100110011111110110011110000001011010011101011001111110011111100111111001111111111000011001001001111110011111100111111101101101111001100111111101100111100000010110100111010110011111100111111001111110011111111110000110010010011111101011110 3f3fb6f33fb3c0b4eb3f3f3f3ff0c93f3f3fb6f33fb3c0b4eb3f3f3f3ff0c93f5e
UTF-8 亐렕虞렧垣企렟렩罹렗霽렢亐렕虞렧垣企렟렩罹렗霽렢^ 11100100101110101001000011101011101000001001010111101000100110011001111011101011101000001010011111100101100111101010001111100100101111001000000111101011101000001001111111101011101000001010100111101111101001111010011011101011101000001001011111101001100111001011110111101011101000001010001011100100101110101001000011101011101000001001010111101000100110011001111011101011101000001010011111100101100111101010001111100100101111001000000111101011101000001001111111101011101000001010100111101111101001111010011011101011101000001001011111101001100111001011110111101011101000001010001001011110 e4ba90eba095e8999eeba0a7e59ea3e4bc81eba09feba0a9efa7a6eba097e99cbdeba0a2e4ba90eba095e8999eeba0a7e59ea3e4bc81eba09feba0a9efa7a6eba097e99cbdeba0a25e
UHC 亐렕虞렧垣企렟렩罹렗霽렢亐렕虞렧垣企렟렩罹렗霽렢^ 11101010101001111000111010101010111010011110010110001110101101101110101010101111110100001110101010001110101100001000111010110111111011001011101010001110101011001111000010111000100011101011001111101010101001111000111010101010111010011110010110001110101101101110101010101111110100001110101010001110101100001000111010110111111011001011101010001110101011001111000010111000100011101011001101011110 eaa78eaae9e58eb6eaafd0ea8eb08eb7ecba8eacf0b88eb3eaa78eaae9e58eb6eaafd0ea8eb08eb7ecba8eacf0b88eb35e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)