To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????z?????????zB 001111110011111100111111001111110011111100111111001111110011111100111111011110100011111100111111001111110011111100111111001111110011111100111111001111110111101001000010 3f3f3f3f3f3f3f3f3f7a3f3f3f3f3f3f3f3f3f7a42
SJIS-WIN 筌k?爰??儀??z筌k?爰??儀??zB 1110001010100011100000101000101100111111111000001010011100111111001111111000101101010110001111110011111101111010111000101010001110000010100010110011111111100000101001110011111100111111100010110101011000111111001111110111101001000010 e2a3828b3fe0a73f3f8b563f3f7ae2a3828b3fe0a73f3f8b563f3f7a42
EUC-JP 筌k?爰??儀??z筌k?爰??儀??zB 1110010010100101101000111110101100111111111000001010100100111111001111111011010110110111001111110011111101111010111001001010010110100011111010110011111111100000101010010011111100111111101101011011011100111111001111110111101001000010 e4a5a3eb3fe0a93f3fb5b73f3f7ae4a5a3eb3fe0a93f3fb5b73f3f7a42
UTF-8 筌k똻爰쇽㏊儀쀬뒍z筌k똻爰쇽㏊儀쀬뒍zB 111001111010110110001100111011111011110110001011111010111001100010111011111001111000100010110000111011001000011110111101111000111000111110001010111001011000010010000000111011001000000010101100111010111001001010001101011110101110011110101101100011001110111110111101100010111110101110011000101110111110011110001000101100001110110010000111101111011110001110001111100010101110010110000100100000001110110010000000101011001110101110010010100011010111101001000010 e7ad8cefbd8beb98bbe788b0ec87bde38f8ae58480ec80aceb928d7ae7ad8cefbd8beb98bbe788b0ec87bde38f8ae58480ec80aceb928d7a42
UHC 筌k똻爰쇽㏊儀쀬뒍z筌k똻爰쇽㏊儀쀬뒍zB 111011111010011110100011111010111000110010000001111010101011101010111100111011111010011110110101111010111111000010010111111011001000101010001010011110101110111110100111101000111110101110001100100000011110101010111010101111001110111110100111101101011110101111110000100101111110110010001010100010100111101001000010 efa7a3eb8c81eababcefa7b5ebf097ec8a8a7aefa7a3eb8c81eababcefa7b5ebf097ec8a8a7a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)