To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 渦??而??飮?????異わ?恂ル?億 100010010101000100111111001111111000111010100111001111110011111110011111010110100011111100111111001111110011111100111111100010001101100110000010111011010011111110011100100101101000001110001011001111111000100110101101 89513f3f8ea73f3f9f5a3f3f3f3f3f88d982ed3f9c96838b3f89ad
EUC-JP 渦??而??飮?????異わ?恂ル?億 101100011011001000111111001111111011110010101001001111110011111111011101101110110011111100111111001111110011111100111111101100001101101110100100111011110011111111010111111101101010010111101011001111111011001010101111 b1b23f3fbca93f3fddbb3f3f3f3f3fb0dba4ef3fd7f6a5eb3fb2af
UTF-8 渦기뫀而얍뭡飮곷겱亮쎈슣異わ쫳恂ル굦億 111001101011100010100110111010101011100010110000111010111010101110000000111010001000000010001100111011001001011010001101111010111010110110100001111010011010001110101110111010101011001110110111111010101011001010110001111011111010010110110111111011001000111010001000111011001000101010100011111001111001010110110000111000111000001010001111111011001010101110110011111001101000000110000010111000111000001110101011111010101011010110100110111001011000010010000100 e6b8a6eab8b0ebab80e8808cec968debada1e9a3aeeab3b7eab2b1efa5b7ec8e88ec8aa3e795b0e3828fecabb3e68182e383abeab5a6e58484
UHC 渦기뫀而얍뭡飮곷겱亮쎈슣異わ쫳恂ル굦億 1110100010111110101100011110001010010001101001001110110010111011101111101110010110111001101111001110101111100110100000011110101110000001101111011110010110111001101111011110101110011010101011111110110010110110101010101110111110100110100010111110001011100001101010111110101110000010100011001110010111100010 e8beb1e291a4ecbbbee5b9bcebe681eb81bde5b9bdeb9aafecb6aaefa68be2e1abeb828ce5e2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)