To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 筌??認??????ワ????幽??秧??苑? 1110001010100011001111110011111110010100010001100011111100111111001111110011111100111111001111111000001110001111001111110011111100111111001111111001011101001000001111110011111111100010010111100011111100111111100010011001000100111111 e2a33f3f94463f3f3f3f3f3f838f3f3f3f3f97483f3fe25e3f3f89913f
EUC-JP 筌??認??洧???ワ????幽??秧??苑? 11100100101001010011111100111111110001111010011100111111001111111000111111000111101101000011111100111111001111111010010111101111001111110011111100111111001111111100110110101001001111110011111111100011101111110011111100111111101100011111000100111111 e4a53f3fc7a73f3f8fc7b43f3f3fa5ef3f3f3f3fcda93f3fe3bf3f3fb1f13f
UTF-8 筌뚮뱷認뗰ℓ洧밸쨨曆ワ퐢栒끾삌幽됲꺂秧덈똻苑볿 111001111010110110001100111010111001101010101110111010111011000110110111111010001010101010001101111010111001011110110000111000101000010010010011111001101011010010100111111010111011000010111000111011001010100010101000111011111010011010001011111000111000001110101111111011011001000010100010111001101010000010010010111010111000000110111110111011001000001010001100111001011011100110111101111010111001000010110010111010101011101010000010111001111010011110100111111010111000110110001000111010111001100010111011111010001000101110010001111010111011001110111111 e7ad8ceb9aaeebb1b7e8aa8deb97b0e28493e6b4a7ebb0b8eca8a8efa68be383afed90a2e6a092eb81beec828ce5b9bdeb90b2eaba82e7a7a7eb8d88eb98bbe88b91ebb3bf
UHC 筌뚮뱷認뗰ℓ洧밸쨨曆ワ퐢栒끾삌幽됲꺂秧덈똻苑볿 11101111101001111000110011101011100100111001110111101100111000111000101111101111101001111010010011101010111110111011100111101011101001001000001111100110101101111010101111101111101111011000101111100010111000111000010111100110100110001001001111101010111010111000100111101101100000111010101111100100111010111000100011101011100011001000000111101010101111011001010001000010 efa78ceb939dece38befa7a4eafbb9eba483e6b7abefbd8be2e385e69893eaeb89ed83abe4eb88eb8c81eabd9442

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)