To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 倭??踰??邑る?語??循??誘??娃 100110000110000000111111001111111110011011111010001111110011111110010111010101111000001011101001001111111000110011101010001111110011111110001111011110100011111100111111100101110101010100111111001111111000100010100001 98603f3fe6fa3f3f975782e93f8cea3f3f8f7a3f3f97553f3f88a1
EUC-JP 倭??踰??邑る?語??循??誘??娃 110011111100000100111111001111111110110011111100001111110011111111001101101110001010010011101011001111111011100011101100001111110011111110111101110110110011111100111111110011011011011000111111001111111011000010100011 cfc13f3fecfc3f3fcdb8a4eb3fb8ec3f3fbddb3f3fcdb63f3fb0a3
UTF-8 倭녾낮踰졿뤃邑る뀆語ⓥ뫗循욕짃誘⑸옲娃 111001011000000010101101111010111000010110111110111010111000001010101110111010001011100010110000111011001010000110111111111010111010010010000011111010011000001010010001111000111000001010001011111010111000000010000110111010001010101010011110111000101001001110100101111010111010101110010111111001011011111010101010111011001001101010010101111011001010011110000011111010001010101010011000111000101001000110111000111011001001100010110010111001011010100010000011 e580adeb85beeb82aee8b8b0eca1bfeba483e98291e3828beb8086e8aa9ee293a5ebab97e5beaaec9a95eca783e8aa98e291b8ec98b2e5a883
UHC 倭녾낮踰졿뤃邑る뀆語ⓥ뫗循욕짃誘⑸옲娃 1110100011011110100001101110101010110011101101111110101110110010101000001110011010001111101101001110101111101001101010101110101110000101100000101110010111011110101010001110001010010001101110011110001011100000101111111110010110100011100100111110101110101111101010011110101110011110101011011110100011011111 e8de86eab3b7ebb2a0e68fb4ebe9aaeb8582e5dea8e291b9e2e0bfe5a393ebafa9eb9eade8df

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)