To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 筌??鬼筌??唯??源?????肉??碎?? 1110001010100011001111110011111110001011010100111110001010100011001111110011111110010111010000100011111100111111100011001011100100111111001111110011111100111111001111111001001111110111001111110011111111100001111010100011111100111111 e2a33f3f8b53e2a33f3f97423f3f8cb93f3f3f3f3f93f73f3fe1ea3f3f
EUC-JP 筌??鬼筌??唯??源?????肉??碎?? 1110010010100101001111110011111110110101101101001110010010100101001111110011111111001101101000110011111100111111101110001011101100111111001111110011111100111111001111111100011011111001001111110011111111100010111011000011111100111111 e4a53f3fb5b4e4a53f3fcda33f3fb8bb3f3f3f3f3fc6f93f3fe2ec3f3f
UTF-8 筌뗫툋鬼筌뗫툖唯뗰쭓源낆죰廬믩뗀肉듿㎤碎ㅼ젔 111001111010110110001100111010111001011110101011111011011000100010001011111010011010110010111100111001111010110110001100111010111001011110101011111011011000100010010110111001011001010010101111111010111001011110110000111011001010110110010011111001101011101010010000111010111000001010000110111011001010001110110000111011111010011010000010111010111010111110101001111010111001011110000000111010001000001010001001111010111001001110111111111000111000111010100100111001111010001010001110111000111000010110111100111011001010000010010100 e7ad8ceb97abed888be9acbce7ad8ceb97abed8896e594afeb97b0ecad93e6ba90eb8286eca3b0efa682ebafa9eb9780e88289eb93bfe38ea4e7a28ee385bceca094
UHC 筌뗫툋鬼筌뗫툖唯뗰쭓源낆죰廬믩뗀肉듿㎤碎ㅼ젔 1110111110100111100010111110101110111000100000111101000010100001111011111010011110001011111010111011100010001101111010101110011010001011111011111010011110001011111010101011100110000101111011001010000110001011111001011111111010010010111010111011011010111110111010111011111110001010111001011010011110101000111000011110111110100100111011001010000010010010 efa78bebb883d0a1efa78bebb88deae68befa78beab985eca18be5fe92ebb6beebbf8ae5a7a8e1efa4eca092

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)