To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 筌ъ?筌??閻?n}筌ъ?筌??閻?n{^ 1110001010100011100001001000110000111111111000101010001100111111001111111110100010000101001111110110111001111101111000101010001110000100100011000011111111100010101000110011111100111111111010001000010100111111011011100111101101011110 e2a3848c3fe2a33f3fe8853f6e7de2a3848c3fe2a33f3fe8853f6e7b5e
EUC-JP 筌ъ?筌??閻?n}筌ъ?筌??閻?n{^ 1110010010100101101001111110110000111111111001001010010100111111001111111110111111100101001111110110111001111101111001001010010110100111111011000011111111100100101001010011111100111111111011111110010100111111011011100111101101011110 e4a5a7ec3fe4a53f3fefe53f6e7de4a5a7ec3fe4a53f3fefe53f6e7b5e
UTF-8 筌ъ넑筌뀁뀥閻쯱n}筌ъ넑筌뀁뀥閻쯱n{^ 111001111010110110001100110100011000101011101011100001001001000111100111101011011000110011101011100000001000000111101011100000001010010111101001100101101011101111101100101011111011000101101110011111011110011110101101100011001101000110001010111010111000010010010001111001111010110110001100111010111000000010000001111010111000000010100101111010011001011010111011111011001010111110110001011011100111101101011110 e7ad8cd18aeb8491e7ad8ceb8081eb80a5e996bbecafb16e7de7ad8cd18aeb8491e7ad8ceb8081eb80a5e996bbecafb16e7b5e
UHC 筌ъ넑筌뀁뀥閻쯱n}筌ъ넑筌뀁뀥閻쯱n{^ 11101111101001111010110011101100100001101001110011101111101001111011001011101100100001011001110011100111101000101010100101101111011011100111110111101111101001111010110011101100100001101001110011101111101001111011001011101100100001011001110011100111101000101010100101101111011011100111101101011110 efa7acec869cefa7b2ec859ce7a2a96f6e7defa7acec869cefa7b2ec859ce7a2a96f6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)