To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 厄??乳??邑??繹??陰??柔ル?沃??楢 10010110111011110011111100111111100100111111101100111111001111111001011101010111001111110011111111100011100010000011111100111111100010010100000100111111001111111000111101011111100000111000101100111111100101111000000000111111001111111001001111101000 96ef3f3f93fb3f3f97573f3fe3883f3f89413f3f8f5f838b3f97803f3f93e8
EUC-JP 厄??乳??邑??繹??陰??柔ル?沃??楢 11001100111100010011111100111111110001101111110100111111001111111100110110111000001111110011111111100101111010000011111100111111101100011010001000111111001111111011110111000000101001011110101100111111110011011110000000111111001111111100011011101010 ccf13f3fc6fd3f3fcdb83f3fe5e83f3fb1a23f3fbdc0a5eb3fcde03f3fc6ea
UTF-8 厄닿낮乳뀐쭫邑룹춷繹먮냱陰뉒춯柔ル쿋沃쇈렖楢 111001011000111010000100111010111000101110111111111010111000001010101110111001001011100110110011111010111000000010010000111011001010110110101011111010011000001010010001111010111010001110111001111011001011011010110111111001111011100110111001111010111010100010101110111010111000001110110001111010011001100110110000111010111000100110010010111011001011011010101111111001101001111110010100111000111000001110101011111011001011111110001011111001101011001010000011111011001000011110001000111010111010000010010110111001101010010110100010 e58e84eb8bbfeb82aee4b9b3eb8090ecadabe98291eba3b9ecb6b7e7b9b9eba8aeeb83b1e999b0eb8992ecb6afe69f94e383abecbf8be6b283ec8788eba096e6a5a2
UHC 厄닿낮乳뀐쭫邑룹춷繹먮냱陰뉒춯柔ル쿋沃쇈렖楢 1110010011111000101101001110101010110011101101111110101011100001101100101110111110100111100111111110101111101001101101111110110010101101100100111110011010111010100100001110101110000110100000011110101111100100100001111110011110101101100011001110101011110101101010111110101110110010101000001110100010101010101111001110001110001110101010111110101011111001 e4f8b4eab3b7eae1b2efa79febe9b7ecad93e6ba90eb8681ebe487e7ad8ceaf5abebb2a0e8aabce38eabeaf9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)