To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 渦??意??碎??筌?????乙??搖??釗? 100010010101000100111111001111111000100011010011001111110011111111100001111010100011111100111111111000101010001100111111001111110011111100111111001111111000100110110011001111110011111110011101100010100011111100111111111110111011101100111111 89513f3f88d33f3fe1ea3f3fe2a33f3f3f3f3f89b33f3f9d8a3f3ffbbb3f
EUC-JP 渦??意??碎??筌??絪??乙??搖??釗? 101100011011001000111111001111111011000011010101001111110011111111100010111011000011111100111111111001001010010100111111001111111000111111010011111011000011111100111111101100101011010100111111001111111101100111101010001111110011111110001111111000111010011000111111 b1b23f3fb0d53f3fe2ec3f3fe4a53f3f8fd3ec3f3fb2b53f3fd9ea3f3f8fe3a63f
UTF-8 渦깅맧意㏝춳碎좊퉲筌앫굢絪뷸뇻乙녠텚搖깃퀣釗퀯 111001101011100010100110111010101011100110000101111010111010011110100111111001101000010010001111111000111000111110011101111011001011011010110011111001111010001010001110111011001010001010001010111011011000100110110010111001111010110110001100111011001001010110101011111010101011010110100010111001111011010110101010111010111011011110111000111010111000011110111011111001001011100110011001111010111000010110100000111011011000010110011010111001101001000010010110111010101011100110000011111011011000000010100011111010011000011110010111111011011000000010101111 e6b8a6eab985eba7a7e6848fe38f9decb6b3e7a28eeca28aed89b2e7ad8cec95abeab5a2e7b5aaebb7b8eb87bbe4b999eb85a0ed859ae69096eab983ed80a3e98797ed80af
UHC 渦깅맧意㏝춳碎좊퉲筌앫굢絪뷸뇻乙녠텚搖깃퀣釗퀯 11101000101111101011000111101011100100001011000011101011111100101010011111101001101011011000111111100001111011111010000011101011101110011000101011101111101001111001110111100011100000101000100111101100110111111011101011100110101101001010011111101011111000001011001111101010101101101001001111101000111101001011000111101010101100111001011111100001111100101011010001000010 e8beb1eb90b0ebf2a7e9ad8fe1efa0ebb98aefa79de38289ecdfbae6b4a7ebe0b3eab693e8f4b1eab397e1f2b442

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)