To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????z??????????zB 0011111100111111001111110011111100111111001111110011111100111111001111110011111101111010001111110011111100111111001111110011111100111111001111110011111100111111001111110111101001000010 3f3f3f3f3f3f3f3f3f3f7a3f3f3f3f3f3f3f3f3f3f7a42
SJIS-WIN 淏幟貊カ涬会スス魘ォz淏幟貊カ涬会スス魘ォzB 1111101101000010100110111110111011100110101110111011011011111011010000011000100111101111101111011011110111101001101101001010101101111010111110110100001010011011111011101110011010111011101101101111101101000001100010011110111110111101101111011110100110110100101010110111101001000010 fb429beee6bbb6fb4189efbdbde9b4ab7afb429beee6bbb6fb4189efbdbde9b4ab7a42
EUC-JP 淏幟貊カ涬会スス魘ォz淏幟貊カ涬会スス魘ォzB 1000111111000111110110011101011011110000111011001011110110001110101101101000111111000111110011111011001011110001100011101011110110001110101111011111001010110110100011101010101101111010100011111100011111011001110101101111000011101100101111011000111010110110100011111100011111001111101100101111000110001110101111011000111010111101111100101011011010001110101010110111101001000010 8fc7d9d6f0ecbd8eb68fc7cfb2f18ebd8ebdf2b68eab7a8fc7d9d6f0ecbd8eb68fc7cfb2f18ebd8ebdf2b68eab7a42
UTF-8 淏幟貊カ涬会スス魘ォz淏幟貊カ涬会スス魘ォzB 111001101011011110001111111001011011100110011111111010001011001010001010111011111011110110110110111001101011011010101100111001001011110010011010111011111011110110111101111011111011110110111101111010011010110110011000111011111011110110101011011110101110011010110111100011111110010110111001100111111110100010110010100010101110111110111101101101101110011010110110101011001110010010111100100110101110111110111101101111011110111110111101101111011110100110101101100110001110111110111101101010110111101001000010 e6b78fe5b99fe8b28aefbdb6e6b6ace4bc9aefbdbdefbdbde9ad98efbdab7ae6b78fe5b99fe8b28aefbdb6e6b6ace4bc9aefbdbdefbdbde9ad98efbdab7a42
UHC 淏幟貊???????z淏幟貊???????zB 1111101111001000111101101011101011011000111001110011111100111111001111110011111100111111001111110011111101111010111110111100100011110110101110101101100011100111001111110011111100111111001111110011111100111111001111110111101001000010 fbc8f6bad8e73f3f3f3f3f3f3f7afbc8f6bad8e73f3f3f3f3f3f3f7a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)