To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 蘂??泣??儀?????蘂??泣??儀?????B 11100101010000010011111100111111100010111000001100111111001111111000101101010110001111110011111100111111001111110011111111100101010000010011111100111111100010111000001100111111001111111000101101010110001111110011111100111111001111110011111101000010 e5413f3f8b833f3f8b563f3f3f3f3fe5413f3f8b833f3f8b563f3f3f3f3f42
EUC-JP 蘂??泣??儀?????蘂??泣??儀?????B 11101001101000100011111100111111101101011110001100111111001111111011010110110111001111110011111100111111001111110011111111101001101000100011111100111111101101011110001100111111001111111011010110110111001111110011111100111111001111110011111101000010 e9a23f3fb5e33f3fb5b73f3f3f3f3fe9a23f3fb5e33f3fb5b73f3f3f3f3f42
UTF-8 蘂띠눖泣곲렟儀륁벢列띕걲蘂띠눖泣곲렟儀륁벢列띕걲B 11101000100110001000001011101011100111011010000011101011100010001001011011100110101100111010001111101010101100111011001011101011101000001001111111100101100001001000000011101011101001011000000111101011101100101010001011101111101001101001110011101011100111011001010111101010101100011011001011101000100110001000001011101011100111011010000011101011100010001001011011100110101100111010001111101010101100111011001011101011101000001001111111100101100001001000000011101011101001011000000111101011101100101010001011101111101001101001110011101011100111011001010111101010101100011011001001000010 e89882eb9da0eb8896e6b3a3eab3b2eba09fe58480eba581ebb2a2efa69ceb9d95eab1b2e89882eb9da0eb8896e6b3a3eab3b2eba09fe58480eba581ebb2a2efa69ceb9d95eab1b242
UHC 蘂띠눖泣곲렟儀륁벢列띕걲蘂띠눖泣곲렟儀륁벢列띕걲B 11100111110111101011011011101100100001111011000011101011111010001000000111101001100011101011000011101011111100001000111111101100100100111011101111100110111010101011011011101011100000011001100111100111110111101011011011101100100001111011000011101011111010001000000111101001100011101011000011101011111100001000111111101100100100111011101111100110111010101011011011101011100000011001100101000010 e7deb6ec87b0ebe881e98eb0ebf08fec93bbe6eab6eb8199e7deb6ec87b0ebe881e98eb0ebf08fec93bbe6eab6eb819942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)