To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 渦??媛?????歪??諭g?怨k?雅 100010010101000100111111001111111001010101010001001111110011111100111111001111110011111110011000011000110011111100111111100101110100000010000010100001110011111110001001100001011000001010001011001111111000100111101011 89513f3f95513f3f3f3f3f98633f3f974082873f8985828b3f89eb
EUC-JP 渦??媛??洹??歪??諭g?怨k?雅 1011000110110010001111110011111111001001101100100011111100111111100011111100011110111010001111110011111111001111110001000011111100111111110011011010000110100011111001110011111110110001111001011010001111101011001111111011001011101101 b1b23f3fc9b23f3f8fc7ba3f3fcfc43f3fcda1a3e73fb1e5a3eb3fb2ed
UTF-8 渦기뫀媛뉒댆洹잆걶歪묅뫀諭g넭怨k쳛雅 111001101011100010100110111010101011100010110000111010111010101110000000111001011010101010011011111010111000100110010010111010111000110010000110111001101011010010111001111011001001111010000110111010101011000110110110111001101010110110101010111010111010110010000101111010111010101110000000111010001010101110101101111011111011110110000111111010111000010010101101111001101000000010101000111011111011110110001011111011001011001110011011111010011001101110000101 e6b8a6eab8b0ebab80e5aa9beb8992eb8c86e6b4b9ec9e86eab1b6e6adaaebac85ebab80e8abadefbd87eb84ade680a8efbd8becb39be99b85
UHC 渦기뫀媛뉒댆洹잆걶歪묅뫀諭g넭怨k쳛雅 1110100010111110101100011110001010010001101001001110101010110000100001111110011110001000101100001110101010110111100111111110001110000001100111001110100011100000100100011110001010010001101001001110101110110001101000111110011110000110101011001110101010110011101000111110101110101011100000011110010010111010 e8beb1e291a4eab087e788b0eab79fe3819ce8e091e291a4ebb1a3e786aceab3a3ebab81e4ba

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)