To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????矣??阿??誼???????臾?? 001111110011111100111111001111110011111100111111111000011110000100111111001111111000100010100010001111110011111110001011011000100011111100111111001111110011111100111111001111110011111111100100011010110011111100111111 3f3f3f3f3f3fe1e13f3f88a23f3f8b623f3f3f3f3f3f3fe46b3f3f
EUC-JP ???靷??矣??阿??誼????靷??臾?? 00111111001111110011111110001111111001111011110100111111001111111110001011100011001111110011111110110000101001000011111100111111101101011100001100111111001111110011111100111111100011111110011110111101001111110011111111100111110011000011111100111111 3f3f3f8fe7bd3f3fe2e33f3fb0a43f3fb5c33f3f3f3f8fe7bd3f3fe7cc3f3f
UTF-8 嶺뚮뿫靷숁꼮矣몄춷阿숇끃誼쁧嶺뚮뿫靷앮뇡臾뺤춷 111011111010011010101011111010111001101010101110111010111011111110101011111010011001110110110111111011001000100010000001111010101011110010101110111001111001111110100011111010111010101010000100111011001011011010110111111010011001100010111111111011001000100010000111111010111000000110000011111010001010101010111100111011001000000110100111111011111010011010101011111010111001101010101110111010111011111110101011111010011001110110110111111011001001010110101110111010111000011110100001111010001000011110111110111010111011101010100100111011001011011010110111 efa6abeb9aaeebbfabe99db7ec8881eabcaee79fa3ebaa84ecb6b7e998bfec8887eb8183e8aabcec81a7efa6abeb9aaeebbfabe99db7ec95aeeb87a1e887beebbaa4ecb6b7
UHC 嶺뚮뿫靷숁꼮矣몄춷阿숇끃誼쁧嶺뚮뿫靷앮뇡臾뺤춷 11100111101011011000110011101011100101111010101111101100111001101001100111100110100001001000100111101011111110001011100011101100101011011001001111100100101110011001100111101011100001011011100111101011111111101001100001101010111001111010110110001100111010111001011110101011111011001110011010011101111001101000011110001001111010111010110010010101111011001010110110010011 e7ad8ceb97abece699e68489ebf8b8ecad93e4b999eb85b9ebfe986ae7ad8ceb97abece69de68789ebac95ecad93

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)